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0.1 Preface 

This is a proto-preface. A more complete preface will be written after these notes are completed. 

These lecture notes are intended to supplement a course in statistical physics at the upper division undergraduate 
or beginning graduate level. 

I was fortunate to learn this subject from one of the great statistical physicists of our time, John Cardy. 

I am grateful to my wife Joyce and to my children Ezra and Lily for putting up with all the outrageous lies I've 
told them about getting off the computer 'in just a few minutes' while working on these notes. 

These notes are dedicated to the only two creatures I know who are never angry with me: my father and my dog. 




Figure 1: My father (Louis) and my dog (Henry). 
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CHAPTER 1. PROBABILITY 



1.2 A Statistical View 



1.2.1 Distributions for a random walk 



Consider the mechanical system depicted in Fig. 1.1, a version of which is often sold in novelty shops. A ball 
is released from the top, which cascades consecutively through TV levels. The details of each ball's motion are 
governed by Newton's laws of motion. However, to predict where any given ball will end up in the bottom row is 
difficult, because the ball's trajectory depends sensitively on its initial conditions, and may even be influenced by 
random vibrations of the entire apparatus. We therefore abandon all hope of integrating the equations of motion 
and treat the system statistically. That is, we assume, at each level, that the ball moves to the right with probability 
p and to the left with probability q = 1 — p. If there is no bias in the system, then p = q = |. The position X N after 
N steps may be written 

JV 
3=1 

where d J r = +1 if the ball moves to the right at level j, and = —1 if the ball moves to the left at level j. At each 
level, the probability for these two outcomes is given by 

P,= P S^ +1 + qS^^\ P *° = + \ (1.2) 

I q if a = — 1 . 

This is a normalized discrete probability distribution of the type discussed in section 1.4 below. The multivariate 
distribution for all the steps is then 

JV 

V(a 1 ,...,a N ) = l[P(a j ). (1.3) 

3=1 

Our system is equivalent to a one-dimensional random walk. Imagine an inebriated pedestrian on a sidewalk 
taking steps to the right and left at random. After N steps, the pedestrian's location is X. 

Now let's compute the average of X: 

N 

= °i) = N ( a ) = N J2 a p ( a ) = N{ p - «) = n ( 2 p - !) ■ ( L4 ) 

j=l a=±l 

This could be identified as an equation of state for our system, as it relates a measurable quantity X to the number 
of steps TV and the local bias p. Next, let's compute the average of X 2 : 

N N 
3=1 j'=l 

Here we have used 



Note that (X 2 ) > (X) 2 , which must be so because 

Var(X) = ((AX) 2 ) = ((X (X)j 2 ) = (X 2 ) (X) 2 . (1.7) 

This is called the variance of X. We have Var(X) = ANp q. The root mean square deviation, AX ims , is the square root 
of the variance: AX lmB = ^/Var(X). Note that the mean value of X is linearly proportional to TV 1 , but the RMS 



1 The exception is the unbiased case p = q = |, where (X) =0. 
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••«••«•«•«• •••«•«••••*•• • JU 

— •••••• • __s 

• ^ 

4 ^^^l^ 
Figure 1.1: The falling ball system, which mimics a one-dimensional random walk. 



fluctuations AX rms are proportional to N 1 / 2 . In the limit AT — > oo then, the ratio Z\X rms / (X) vanishes as N~ x / 2 . 
This is a consequence of the central limit theorem (see §1.4.2 below), and we shall meet up with it again on several 
occasions. 

We can do even better. We can find the complete probability distribution for X. It is given by 

where N R/L are the numbers of steps taken to the right /left, with TV = N R + N L , and X = N R — N L . There are 
many independent ways to take N R steps to the right. For example, our first N R steps could all be to the right, and 
the remaining A^ L = N — N R steps would then all be to the left. Or our final A^ R steps could all be to the right. For 
each of these independent possibilities, the probability is p N ^ q N ^ . How many possibilities are there? Elementary 
combinatorics tells us this number is 

(1-9) 



(1.10) 



N J N 1 N ' ' 

Note that N ± X = 27V R/L , so we can replace N K/h = ±(N± X). Thus, 

AH 



P — 

1 N,X ( N+X \i ( N-X \\ 



p (N+X)/2 g (N-X)/2 



1.2.2 Thermodynamic limit 

Consider the limit N — > oo but with x = X/N finite. This is analogous to what is called the thermodynamic limit 
in statistical mechanics. Since N is large, x may be considered a continuous variable. We evaluate In P N x using 
Stirling's asymptotic expansion 

\nN\~N\nN -N + 0{\nN) . (1.11) 
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We then have 



\nP NiX ~NhiN - N -±N(l + x)\n ±iV(l + x) + ±iV(l+x) 

- |iV(l - x)ln [iiV(l-x)] + |iV(l - x) + |JV(1 + ar) lnp + ±iV(l - x) lng 



-JV 



(i±£)ln(l±£) + (l=£)ln(i=£) + TV (*±£) Inp + (i=2) lng 



(1.12) 



Notice that the terms proportional to TV In TV have all cancelled, leaving us with a quantity which is linear in TV. 
We may therefore write In P N x = —Nf(x) + C(ln TV), where 



/(*) = J(l±£) in (1±«) + (!=£) ]n (i=£)] - [(1±£) Inp + (±=£) lng 
We have just shown that in the large TV limit we may write 

N ,X — u e > 



(1.13) 



(1.14) 



where C is a normalization constant 2 . Since TV is by assumption large, the function P N x is dominated by the 
minimum (or minima) of /(x), where the probability is maximized. To find the minimum of f(x), we set f'(x) = 0, 
where 



«*)-*-(|-t^) 



Setting /' (x) = 0, we obtain 
We also have 

so invoking Taylor 's theorem, 

Putting it all together, we have 

P 



1 + x p 
1 — x q 



=>■ x = p — q 



/"(*) = 



l-x 2 



f(x) = m + y"(x) (x-x) 2 + 



C exp 



TV(x - x) 2 
&pq 



C exp 



8Npq 



(1.15) 

(1.16) 

(1.17) 
(1.18) 

(1.19) 



where X = (X) = N(p — q) = Nx. The constant C is determined by the normalization condition, 

(X-X) 2 ' 



P N-X " 

X=-oo 



\\ dXC exp 



8Npq 



s/VnNpq C 



(1.20) 



and thus C = 1/ \f2nNpq. Why don't we go beyond second order in the Taylor expansion of f(x)7 We will find 
out in 81.4.2 below. 



2 The origin of C lies in the 0(ln N) and O(N ) terms in the asymptotic expansion of In AM. We have ignored these terms here. Accounting 
for them carefully reproduces the correct value of C in eqn. 1.20. 
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Figure 1.2: Comparison of exact distribution of eqn. 1.10 (red squares) with the Gaussian distribution of eqn. 1.19 
(blue line). 



1.2.3 Entropy and energy 

The function f(x) can be written as a sum of two contributions, f(x) = e(x) — s(x), where 

s(x) = _(i± £ ) ln (i± £ )_(x-) ln (l-) 
e(x) = -\\n{pq) ~ \x\n(p/q) . 

The function S(N, x) = Ns(x) is analogous to the statistical entropy of our system 3 . We have 



(1.21) 



S(N, X )=Ns(x) = ^y^ N » +x) ). (1.22) 

Thus, the statistical entropy is the logarithm of the number of ways the system can be configured so as to yield the same value 
of X (at fixed N). The second contribution to f(x) is the energy term. We write 

E{N,x) = Ne(x) = -\N\n{pq) - \Nx\n(p/q) . (1.23) 

The energy term biases the probability P N x = exp(5 — E) so that low energy configurations are more probable than 
high energy configurations. For our system, we see that when p < q (i.e. p < |), the energy is minimized by taking x 
as small as possible (meaning as negative as possible). The smallest possible allowed value of x = X/N is x = — 1. 
Conversely, when p > q (i.e. p > i), the energy is minimized by taking x as large as possible, which means x = 1. 
The average value of x, as we have computed explicitly, is x = p — q = 2p — 1, which falls somewhere in between 
these two extremes. 

In actual thermodynamic systems, as we shall see, entropy and energy are not dimensionless. What we have 
called S here is really S/k B , which is the entropy in units of Boltzmann's constant. And what we have called E 
here is really E/k B T, which is energy in units of Boltzmann's constant times temperature. 



3 The function s(x) is the specific entropy. 
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1.2.4 Entropy and information theory 

It was shown in the classic 1948 work of Claude Shannon that entropy is in fact a measure of information 4 . Suppose 
we observe that a particular event occurs with probability p. We associate with this observation an amount of 
information I(p). The information I(p) should satisfy certain desiderata: 

1 Information is non-negative, i.e. I(p) > 0. 

2 If two events occur independently so their joint probability is p x p 2 , then their information is additive, i.e. 
liPiPz) =/(p x )+/(Pa). 

3 I(p) is a continuous function of p. 

4 There is no information content to an event which is always observed, i.e. 1(1) = 0. 

From these four properties, it is easy to show that the only possible function I(p) is 

I(p) = -Ahip, (1.24) 

where A is an arbitrary constant that can be absorbed into the base of the logarithm, since log b x = In xj In b. We 
will take A = 1 and use e as the base, so I(p) = — In p. Another common choice is to take the base of the logarithm 
to be 2, so I(p) — — log 2 p. In this latter case, the units of information are known as bits. Note that 1(0) = oo. This 
means that the observation of an extremely rare event carries a great deal of information. 

Now suppose we have a set of events labeled by an integer n which occur with probabilities {p n }. What is 
the expected amount of information in N observations? Since event n occurs an average of Np n times, and the 
information content in p n is — hip n , we have that the average information per observation is 

n 

which is known as the entropy of the distribution. Thus, maximizing S is equivalent to maximizing the information 
content per observation. 

Consider, for example, the information content of course grades. As we have seen, if the only constraint on the 
probability distribution is that of overall normalization, then S is maximized when all the probabilities p n are 
equal. The binary entropy is then S = log 2 T, since p n = 1/-T. Thus, for pass/fail grading, the maximum average 
information per grade is — log 2 (|) = log 2 2=1 bit. If only A, B, C, D, and F grades are assigned, then the 
maximum average information per grade is log 2 5 = 2.32 bits. If we expand the grade options to include {A+, A, 
A-, B+, B, B-, C+, C, C-, D, F}, then the maximum average information per grade is log 2 11 = 3.46 bits. 

Equivalently, consider, following the discussion in vol. 1 of Kardar, a random sequence {n ll n 2 , ■ ■ ■ , n N } where 
each element takes one of K possible values. There are then K N such possible sequences, and to specify one of 
them requires \og 2 (K N ) = N log 2 K bits of information. However, if the value n occurs with probability p n , then 
on average it will occur N n = Np n times in a sequence of length N, and the total number of such sequences will 
be 

N\ 

g(N) = . (1.26) 

TT , N ! 

In general, this is far less that the total possible number K N , and the number of bits necessary to specify one from 
among these g(N) possibilities is 

K K 

\og 2 g(N) = log 2 (7V!) -^log 2 (iV n !) « -iV^p„log 2 p„ , (1.27) 

n— 1 n—1 



4 See 'An Introduction to Information Theory and Entropy' by T. Carter, Santa Fe Complex Systems Summer School, June 2011. Available 
online at http:/ / astarte.csustan.edu/ tom/SFI-CSSS/info-theory/info-lec.pdf. 
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where we have invoked Stirling's approximation. If the distribution is uniform, then we have p n = -i for all 

7i € {1, ... , K}, and log 2 g(N) = N log 2 K. 



1.3 Probability Distributions from Maximum Entropy 

We have shown how one can proceed from a probability distribution and compute various averages. We now 
seek to go in the other direction, and determine the full probability distribution based on a knowledge of certain 
averages. 

At first, this seems impossible. Suppose we want to reproduce the full probability distribution for an TV-step 
random walk from knowledge of the average (X) = (2p — 1)N. The problem seems ridiculously underdeter- 
mined, since there are 2 N possible configurations for an TV-step random walk: = ±1 for j = 1, . . . , N. Overall 
normalization requires 

^P(a l! ... !< 7 JV ) = l ! (1.28) 

but this just imposes one constraint on the 2 N probabilities P{<J l , . . . , cr N ), leaving 2 N — 1 overall parameters. What 
principle allows us to reconstruct the full probability distribution 

N N 

P{a u . . . ,a N ) = H (P^,i +QS i) = nP (1+ ^ )/2 <Z (1 ^ )/2 , d-29) 
corresponding to TV independent steps? 

1.3.1 The principle of maximum entropy 

The entropy of a discrete probability distribution {p n } is defined as 

S=-J2Pn^Pn, (1-30) 

n 

where here we take e as the base of the logarithm. The entropy may therefore be regarded as a function of the 
probability distribution: S = S ({p n }) ■ One special property of the entropy is the following. Suppose we have two 
independent normalized distributions {p^ } and {p^ } . The joint probability for events a and b is then P ab = p^Pb- 
The entropy of the joint distribution is then 

a b a b a b 

= -Y,Pa ■ Erf - Erf ln rf • Erf = - Erf ln rf - Erf ln rf 

a b b a a b 

= s A + s B . 

Thus, the entropy of a joint distribution formed from two independent distributions is additive. 

Suppose all we knew about {p n } was that it was normalized. Then P n = 1- This is a constraint on the values 
{p n }. Let us now extremize the entropy S with respect to the distribution {p n }, but subject to the normalization 
constraint. We do this using Lagrange's method of undetermined multipliers. We define 



S*({Pnh A ) = -E p » ln?7 » ~ A (E P « ~ X ) 

71 n 



(1.31) 
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and we freely extremize S* over all its arguments. Thus, for all n we have 

— = -(\np n + l + \) =0 (1.32) 

as well as 

n 

From the first of these equations, we obtain p n = e~^ x \ and from the second we obtain 

^ Plt = e-^.^l = re-(W), (1.34) 

n n 

where r = 1 is the total number of possible events. Thus, 



Pn = Ji, (1-35) 



which says that all events are equally probable. 



Now suppose we know one other piece of information, which is the average value of some quantity X = 
J2 n X n P n - We now extremize S subject to two constraints, and so we define 

S*({pJ,\ ,\ 1 )=-Y^Pn^Pn-*o(j2Pn- 1 )- X l{Y, X nPn-X) • (1-36) 

n n n 

We then have 

OS* 

— = -(\n Pn + l + \ Q + \ 1 X n ) = , (1.37) 
which yields the two-parameter distribution 

p n = e -( 1+A o) e - A i x - . (1.38) 

To fully determine the distribution {p n } we need to invoke the two equations J2 n Pn = 1 anc ^ J2 n X nPn = x > 
which come from extremizing S* with respect to A and X lr respectively: 

e -(l+A„)^ e -A 1 X„ = 1 (139) 
n 

e -(i+A ) X n e~ A i X " = X . (1.40) 

n 

General formulation 

The generalization to K extra pieces of information (plus normalization) is immediately apparent. We have 

X a = ^KPn, (1.41) 

n 

and therefore we define 

K 

S*({PJ,{K}) =-J2Pn^Pn-J2 X «{Y, X nPn-X a ) , fl-42) 

n a— n 
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with xli °^ = x( a ~ ^ = 1. Then the optimal distribution which extremizes S subject to the K + 1 constraints is 



(1.43) 



where Z = e 1+x « is determined by normalization: ^2 n p n = 1- This is a (A' + l)-parameter distribution, with 
{A , X 1 , . . . , X K } determined by the K + 1 constraints in eqn. 1.41. 

Example 

As an example, consider the random walk problem. We have two pieces of information: 

^■•■^P(a 1 ,...,^) = l (1.44) 

N 

E---E / >i'---'^)E fT . = x - (i-^s) 

Here the discrete label n from §1.3.1 ranges over 2 N possible values, and may be written as an iV digit binary 
number r N ■ ■ ■ r l7 where n = |(1 + <Jj) is or 1. Extremizing S subject to these constraints, we obtain 

P(a 1 ,...,a N )=Ccxpl-\J2v J ) =CY[e- x °> , (1.46) 

I 3 > 3=1 

where C = e~( 1+A °) and A = A 2 . Normalization then requires 

TrP = =C(e x + e~ x ) N , (1.47) 

hence C = (cosh X)~ N . We then have 

P(a u ,a N ) = J] -X--X = II (*> V + - d-48) 

3=1 ' ' 3=1 

where 

q=l-p= , _. . (1.49) 



We then have X = (2p — 1)^, which determines p = i(iV + X), and we have recovered the correct distribution. 
1.3.2 Continuous probability distributions 

Suppose we have a continuous probability density -P(v) defined over some set 0. We have observables 

X a = [dfiX a (<p)P(<p) , (1.50) 
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where dp is the appropriate integration measure. We assume d/i = Y\f=i d<Pj> where D is the dimension of 0. 
Then we extremize the functional 

S*[P(<p),{\ a }] = [dpP^lnP^-^xJ fdpP(<p)X a (<p)-X") (1-51) 



with respect to -P(y) and with respect to {A a }. Again, X°(ip) = X° = 1. This yields the following result: 



A" 



\nP( V >) = -l-J2KX a (<f)- (1-52) 

a=0 

The K + 1 Lagrange multipliers {A a } are then determined from the K + 1 constraint equations in eqn. 1.50. 
As an example, consider a distribution P(x) over the real numbers R. We constrain 



(1.53) 



— oo — oo — oo 

Extremizing the entropy, we then obtain 

P(:r) ^Ce^ 1 *"^ 2 , (1.54) 
where C = e~( 1+A °). We already know the answer: 

P(X) = -J=L= e -(^A l ) 2 /2- 2 , (L55) 



In other words, X ± = -fi/a 2 and A 2 = l/2cr 2 , with C = /2<T /V2ira 2 . 

1.4 General Aspects of Probability Distributions 

1.4.1 Discrete and continuous distributions 

Consider a system whose possible configurations | n ) can be labeled by a discrete variable n E C, where C is the 
set of possible configurations. The total number of possible configurations, which is to say the order of the set C, 
may be finite or infinite. Next, consider an ensemble of such systems, and let P n denote the probability that a 
given random element from that ensemble is in the state (configuration) \n). The collection {P n } forms a discrete 
probability distribution. We assume that the distribution is normalized, meaning 

E P « = 1 - (1-56) 

Now let A n be a quantity which takes values depending on n. The average of A is given by 

(A) = Y / P n A n . (1.57) 

n£C 

Typically, C is the set of integers (Z) or some subset thereof, but it could be any countable set. As an example, 
consider the throw of a single six-sided die. Then P n — i for each n G {1, . . . , 6}. Let A n = if n is even and 1 if n 
is odd. Then find (A) = |, i.e. on average half the throws of the die will result in an even number. 
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It may be that the system's configurations are described by several discrete variables {n 11 n 2 , n 3 , . . .}. We can 
combine these into a vector n and then we write P n for the discrete distribution, with '}2 in P n = 1. 

Another possibility is that the system's configurations are parameterized by a collection of continuous variables, 
(p = {ip 1 , . . . , <p n }. We write ip G fi, where il is the phase space (or configuration space) of the system. Let d/i be a 
measure on this space. In general, we can write 

d/i = W(ifi, ...,tp n ) dip! dip 2 --- dip n . (1.58) 

The phase space measure used in classical statistical mechanics gives equal weight W to equal phase space vol- 
umes: 

r 

dft = C U dq a d Va , (1.59) 

cr=l 

where C is a constant we shall discuss later on below 5 . 

Any continuous probability distribution P(<p) is normalized according to 

fdfiP{<p) = l. (1.60) 



n 

The average of a function A(cp) on configuration space is then 



(A) = Jd^P{ v )A( v ). (1.61) 

n 

For example, consider the Gaussian distribution 

p(x) = 1 e -^-M) 2 /2<r 2 _ ( L62 ) 
V27TO" 2 

From the result 6 

c 

^e- ai2 e^ = J-e^ 4a , (1.63) 



(1.64) 



we see that P{x) is normalized. One can then compute 

(x) = n 

We call /Lt the mean and a the standard deviation of the distribution, eqn. 1.62. 

The quantity P(<p) is called the distribution or probability density. One has 

P{tp) dfi = probability that configuration lies within volume d/i centered at (p 

For example, consider the probability density P = 1 normalized on the interval x e [0, l] . The probability that 
some x chosen at random will be exactly |, say, is infinitesimal - one would have to specify each of the infinitely 
many digits of x. However, we can say that x e [0.45 , 0.55] with probability j^. 



5 Such a measure is invariant with respect to canonical transformations, which are the broad class of transformations among coordinates 
and momenta which leave Hamilton's equations of motion invariant, and which preserve phase space volumes under Hamiltonian evolution. 
For this reason dfi is called an invariant phase space measure. See the discussion in appendix II of chapter 4. 

6 Memorize this! 
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If x is distributed according to P 1 (x), then the probability distribution on the product space [x 1 , x 2 ) is simply the 
product of the distributions: 

P 2 (x 1 ,x 2 )=P 1 (x 1 )P 1 (x 2 ). (1.65) 
Suppose we have a function (j)(x 1 , . . . , x N ). How is it distributed? Let Q(<p) be the distribution for cj). We then have 



oo oo 



V{4>) = jdx 1 --- j dx N P N (x 1 , . . .,x N )s(j>(x 1 , . . .,x N ) - 

(1.66) 

= dx 1 --- x N P 1 (x 1 )---P 1 (x N )s(j)(x 1 ,...,x N )-())j , 



-OO — OO 



where the second line is appropriate if the {xj} are themselves distributed independently. Note that 



■P(0 = 1. (1-67) 



so Vi^) is itself normalized. 



1.4.2 Central limit theorem 

In particular, consider the distribution function of the sum 



N 



X = J2^- (1-68) 



i=i 



We will be particularly interested in the case where N is large. For general N, though, we have 



oo oo 



V N {X)= Jdx x --- Jdx N P 1 (x 1 )---P 1 (x N )5(x 1 +x 2 + ... + x N -X) . (1.69) 



— oo — oo 
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It is convenient to compute the Fourier transform 7 of V{X): 

oo 

V N (k) = [dXV N (X)e- lkx (1.70) 



-OO 

OO OO oo 

<dXj dXl ...[ x N P^xj) ■ ■ ■ P^Xjf) 5(xj_ + ... + x N -X) e~ lkx 

-oo — OO — OO 

I N 



= [Pi(k)Y 
where 

oo 

P 1 (k) = fdasP 1 (x)e- ihB (1.71) 



is the Fourier transform of the single variable distribution Pi(x). The distribution V N (X) is a convolution of the 
individual P 1 (.tJ distributions. We have therefore proven that the Fourier transform of a convolution is the product of 
the Fourier transforms. 

OK, now we can write for P 1 (k) 

oo 

P x (fc) = jdxP r {x)(l-ikx- \k 2 x 2 + \ik 3 x 3 + ...) 

J (1.72) 

— oo 

= 1 - ik(x) - A k 2 (x 2 ) + I i k 3 (x 3 ) + ... . 



Thus, 



where 



In Pj (k) = -ink - \a 2 k 2 + \ i 7 3 k 3 + . . . , (1.73) 



/! = {x) 

a 2 = (x 2 ) (x) 2 (1.74) 
7 3 = (x 3 ) - 3 (x 2 ) (x) + 2 (x) 3 



We can now write 

1 N 



P^k)]" = e- m » k e~ N ° k I 2 e iN ^ k / 6 ■ ■ ■ (1.75) 



7 Jean Baptiste Joseph Fourier (1768-1830) had an illustrious career. The son of a tailor, and orphaned at age eight, Fourier's ignoble status 
rendered him ineligible to receive a commission in the scientific corps of the French army. A Benedictine minister at the Ecole Royale Militaire 
of Auxerre remarked, "Fourier, not being noble, could not enter the artillery, although he were a second Newton. " Fourier prepared for the priesthood, 
but his affinity for mathematics proved overwhelming, and so he left the abbey and soon thereafter accepted a military lectureship position. 
Despite his initial support for revolution in France, in 1794 Fourier ran afoul of a rival sect while on a trip to Orleans and was arrested and very 
nearly guillotined. Fortunately the Reign of Terror ended soon after the death of Robespierre, and Fourier was released. He went on Napoleon 
Bonaparte's 1798 expedition to Egypt, where he was appointed governor of Lower Egypt. His organizational skills impressed Napoleon, and 
upon return to France he was appointed to a position of prefect in Grenoble. It was in Grenoble that Fourier performed his landmark studies 
of heat, and his famous work on partial differential equations and Fourier series. It seems that Fourier's fascination with heat began in Egypt, 
where he developed an appreciation of desert climate. His fascination developed into an obsession, and he became convinced that heat could 
promote a healthy body. He would cover himself in blankets, like a mummy, in his heated apartment, even during the middle of summer. 
On May 4, 1830, Fourier, so arrayed, tripped and fell down a flight of stairs. This aggravated a developing heart condition, which he refused 
to treat with anything other than more heat. Two weeks later, he died. Fourier's is one of the 72 names of scientists, engineers and other 
luminaries which are engraved on the Eiffel Tower. Source: http:/ /www.robertnowlan.com/pdfs/Fourier,%20Joseph.pdf 
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Now for the inverse transform. In computing V N (X), we will expand the term e lN i 3 k 3 /6 anc j a v{ subsequent terms 
in the above product as a power series in k. We then have 



oo 

dk 



Vn ( X ) = j g eW-m e -N« 2 ^/2 | x + 1 i Nl 3 k 3 + | 

— OO 

_ I A^3 , ^ 1 e -(X-Nrf/2N^ (1-76) 

= 1 e -&-Ntf/2No» (TV — > oo) . 

In going from the second line to the third, we have written X = y/~N £, in which case 

N ^ = g f / w hich 

gives a subleading contribution which vanishes in the N — > oo limit. We have just proven the central limit theorem: 
in the limit TV — ► oo, the distribution of a sum of N independent random variables x i is a Gaussian with mean Nfj, 
and standard deviation ^/N a. Our only assumptions are that the mean \i and standard deviation a exist for the 
distribution P 1 (x). Note that P x (x) itself need not be a Gaussian - it could be a very peculiar distribution indeed, 
but so long as its first and second moment exist, where the fc th moment is simply (x k ), the distribution of the sum 
X = Yli=i x i is a Gaussian. 



1.4.3 Multidimensional Gaussian integral 

Consider the multivariable Gaussian distribution, 

1 /2 

P{X) = {jS^J ° XP ( " 2 Xl Aj X >) ' (L77) 

where A is a positive definite matrix of rank n. A mathematical result which is extremely important throughout 
physics is the following: 

1/2 00 00 

Z ^ = (^ItW fa* 1 ' " J dXn 6XP ( ~ ^ ^ + 5l Xi ) = ° XP 6j A ^ bj ) ' ^' 78 ^ 

— oo — oo 

Here, the vector b = (b 1} ... , b n ) is identified as a source. Since Z(0) = 1, we have that the distribution P{x) is 
normalized. Now consider averages of the form 



(1.79) 

6=0 



(x h --- Xj J= ld n xP(x)x 3 
d n Z(b) 



db- ■■■ <% ■ 

Jl J2 



^— ' J e T(l)J e r(2) J e T(2fc-l)-'cr(2 



contractions 



The sum in the last term is over all contractions of the indices {j x , ... , j 2 fci- A contraction is an arrangement of 
the 2fc indices into k pairs. There are C 2k = (2k)\/2 k kl possible such contractions. To obtain this result for C k , 
we start with the first index and then find a mate among the remaining 2k — 1 indices. Then we choose the next 
unpaired index and find a mate among the remaining 2k — 3 indices. Proceeding in this manner, we have 

C 2k = (2fc - 1) ■ (2ft - 3). -3. 1 = M . (1.80) 



1.5. APPENDIX: BAYESIAN STATISTICS 
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Equivalently, we can take all possible permutations of the 2k indices, and then divide by 2 k kl since permuta- 
tion within a given pair results in the same contraction and permutation among the k pairs results in the same 
contraction. For example, for k = 2, we have C 4 = 3, and 

(a?, a?, x j x, )=A7) Aj\ + At) Aj) + Aj) Aj 1 . (1.81) 



1.5 Appendix : Bayesian Statistics 

Let the probability of a discrete event A be P(j4). We now introduce two additional probabilities. The joint 
probability for events A and B together is written P(A n B). The conditional probability of P given A is P(B|j4). We 
can compute the joint probability P(A (1 B) = P(B n A) in two ways: 

P(A HB) = P{A\B) ■ P(B) = P{B\A) ■ P(A) . (1.82) 

Thus, 

*W = «P • (1-83) 
a result known as Bayes' theorem. Now suppose the 'event space' is partitioned as {A { }. Then 

P(B)=Y / P(B\A l )-P(A l ). (1.84) 

We then have 

_ P(B|A t )-P(A t ) 

a result sometimes known as the extended form of Bayes' theorem. When the event space is a 'binary partition' 
{A, ^A}, we have 

PM im_ P(B\A)-P(A) 

F{AlB > ~ P{B\A) ■ P(A) + P(B\^A) ■ PirA) ■ ( 86) 

Note that P(A\B) + P(^A\B) = 1 (which follows from ^A = A). 

As an example, consider the following problem in epidemiology. Suppose there is a rare but highly contagious 
disease A which occurs in 0.01% of the general population. Suppose further that there is a simple test for the 
disease which is accurate 99.99% of the time. That is, out of every 10,000 tests, the correct answer is returned 9,999 
times, and the incorrect answer is returned only once 8 . Now let us administer the test to a large group of people 
from the general population. Those who test positive are quarantined. Question: what is the probability that 
someone chosen at random from the quarantine group actually has the disease? We use Bayes' theorem with the 
binary partition {A, ~^A}. Let B denote the event that an individual tests positive. Anyone from the quarantine 
group has tested positive. Given this datum, we want to know the probability that that person has the disease. 
That is, we want P(A\B). Applying eqn. 1.86 with 

P(A) = 0.0001 , P(->A) = 0.9999 , P{B\A) = 0.9999 , P(B\->A) = 0.0001 , 

we find P{A\B) = |. That is, there is only a 50% chance that someone who tested positive actually has the disease, 
despite the test being 99.99% accurate! The reason is that, given the rarity of the disease in the general population, 
the number of false positives is statistically equal to the number of true positives. 



8 Epidemiologists define the sensitivity of a binary classification test as the fraction of actual positives which are correctly identified, and 
the specificity as the fraction of actual negatives that are correctly identified. In our example in the text, the sensitivity and specificity are both 
0.9999. 
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For continuous distributions, we speak of a probability density. We then have 



P(y)= Ida P(y\x) ■ P(x) 



(1.87) 



and 



P(x\y) 



P(y\x) ■ P(x) 



(1.88) 



Jdx' P(y\x') ■ P(x') ' 



The range of integration may depend on the specific application. 

The quantities P(AA are called the prior distribution. Clearly in order to compute P{B) or P{A i \B) we must know 
the priors, and this is usually the weakest link in the Bayesian chain of reasoning. If our prior distribution is not 
accurate, Bayes' theorem will generate incorrect results. One approach to obtaining the prior probabilities P(A,i) 
is to obtain them from a maximum entropy construction. 



Chapter 2 



Thermodynamics 
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CHAPTER 2. THERMODYNAMICS 



2.2 What is Thermodynamics? 

Thermodynamics is the study of relations among the state variables describing a thermodynamic system, and of 
transformations of heat into work and vice versa. 



2.2.1 Thermodynamic systems and state variables 

Thermodynamic systems contain large numbers of constituent particles, and are described by a set of state variables 
which describe the system's properties in an average sense. State variables are classified as being either extensive 
or intensive. 

Extensive variables, such as volume V, particle number N, total internal energy E, magnetization M, etc., scale 
linearly with the system size, i.e. as the first power of the system volume. If we take two identical thermodynamic 
systems, place them next to each other, and remove any barriers between them, then all the extensive variables 
will double in size. 

Intensive variables, such as the pressure p, the temperature T, the chemical potential /x, the electric field E, etc., are 
independent of system size, scaling as the zeroth power of the volume. They are the same throughout the system, 
if that system is in an appropriate state of equilibrium. The ratio of any two extensive variables is an intensive 
variable. For example, we write n = N/V for the number density, which scales as V°. Intensive variables may 
also be inhomogeneous. For example, n(r) is the number density at position r, and is defined as the limit of AN/ AV 
of the number of particles AiV inside a volume AV^ which contains the point r, in the limit V 3> Ay S> V/N. 

Classically, the full motion of a system of TV point particles requires 6N variables to fully describe it (3N positions 
and 3 velocities or momenta, in three space dimensions) 1 . Since the constituents are very small, N is typically 
very large. A typical solid or liquid, for example, has a mass density on the order of q ~ 1 g/cm 3 ; for gases, 
q ~ 1CP 3 g/cm 3 . The constituent atoms have masses of 10° to 10 2 grams per mole, where one mole of X contains 
iV A of X, and N A = 6.02 2 1 4 1 5 x 10 23 is Avogadro's number. Thus, for solids and liquids we roughly expect 
number densities n of 1CP 2 — 10°mol/cm 3 for solids and liquids, and 1CP 5 — 10 _3 mol/cm 3 for gases. Clearly 
we are dealing with fantastically large numbers of constituent particles in a typical thermodynamic system. The 
underlying theoretical basis for thermodynamics, where we use a small number of state variables to describe a 
system, is provided by the microscopic theory of statistical mechanics, which we shall study in the weeks ahead. 

Intensive quantities such as p, T, and n ultimately involve averages over both space and time. Consider for 
example the case of a gas enclosed in a container. We can measure the pressure (relative to atmospheric pressure) 
by attaching a spring to a moveable wall, as shown in Fig. 2.2. From the displacement of the spring and the value 
of its spring constant k we determine the force F. This force is due to the difference in pressures, so p = p Q + F/A. 
Microscopically, the gas consists of constituent atoms or molecules, which are constantly undergoing collisions 
with each other and with the walls of the container. When a particle bounces off a wall, it imparts an impulse 
2n(n • p), where p is the particle's momentum and n is the unit vector normal to the wall. (Only particles with 
p ■ n > will hit the wall.) Multiply this by the number of particles colliding with the wall per unit time, and one 
finds the net force on the wall; dividing by the area gives the pressure p. Within the gas, each particle travels for 
a distance £, called the mean free path, before it undergoes a collision. We can write £ = vt, where v is the average 
particle speed and r is the mean free time. When we study the kinetic theory of gases, we will derive formulas 
for I and v (and hence r). For now it is helpful to quote some numbers to get an idea of the relevant distance 
and time scales. For 2 gas at standard temperature and pressure (T = 0° C, p = 1 atm), the mean free path is 
I m 1.1 X 10~ 5 cm, the average speed is v w 480 m/s, and the mean free time is r « 2.5 x 10~ 10 s. Thus, particles in 
the gas undergo collisions at a rate r _1 ps 4.0 x 10 9 s _1 . A measuring device, such as our spring, or a thermometer, 

1 For a system of N molecules which can freely rotate, we must then specify 3N additional orientational variables - the Euler angles - and 
their 37V conjugate momenta. The dimension of phase space is then 12N. 
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individual particles 
classical mechanics 

variables: {x i , p^} 



statistical mechanics 



gases, liquids, solids 
thermodynamics 

variables: T,p,V, . . . 



individual people human populations 

psychology sociology/politics 

variables: ??? variables: ??? 

Figure 2.1: From microscale to macroscale : physical versus social sciences. 

effectively performs time and space averages. If there are -ZV C collisions with a particular patch of wall during some 
time interval on which our measurement device responds, then the root mean square relative fluctuations in the 

— 1/2 

local pressure will be on the order of N c ' times the average. Since N c is a very large number, the fluctuations 
are negligible. 

If the system is in steady state, the state variables do not change with time. If furthermore there are no macroscopic 
currents of energy or particle number flowing through the system, the system is said to be in equilibrium. A 
continuous succession of equilibrium states is known as a thermodynamic path, which can be represented as a 
smooth curve in a multidimensional space whose axes are labeled by state variables. A thermodynamic process 
is any change or succession of changes which results in a change of the state variables. In a cyclic process, the 
initial and final states are the same. In a quasistatic process, the system passes through a continuous succession of 
equilibria. A reversible process is one where the external conditions and the thermodynamic path of the system can 
be reversed (at first this seems to be a tautology). All reversible processes are quasistatic, but not all quasistatic 
processes are reversible. For example, the slow expansion of a gas against a piston head, whose counter-force is 
always infinitesimally less than the force pA exerted by the gas, is reversible. To reverse this process, we simply 
add infinitesimally more force to pA and the gas compresses. A quasistatic process which is not reversible: slowly 
dragging a block across the floor, or the slow leak of air from a tire. Irreversible processes, as a rule, are dissipative. 
Other special processes include isothermal (dT = 0) isobaric (dp = 0), isochoric (dV = 0), and adiabatic (dQ = 0, 
i.e. no heat exchange): 

reversible: dQ = T dS 
spontaneous: dQ < T dS 
adiabatic: dQ = 
quasistatic: infinitely slowly 

We shall discuss later the entropy S and its connection with irreversibility. 

How many state variables are necessary to fully specify the equilibrium state of a thermodynamic system? For 
a single component system, such as water which is composed of one constituent molecule, the answer is three. 
These can be taken to be T, p, and V. One always must specify at least one extensive variable, else we cannot 
determine the overall size of the system. For a multicomponent system with g different species, we must specify 
g + 2 state variables, which may be {T,p, N t ,..., N g }, where N a is the number of particles of species a. Another 
possibility is the set (T,p, V,x 1 ,... , x g _ 1 }, where the concentration of species a is x a = N a /N. Here, N = J2 9 a =i N a 
is the total number of particles. Note that J21=i x a = 1- 

If then follows that if we specify more than g + 2 state variables, there must exist a relation among them. Such 
relations are known as equations of state. The most famous example is the ideal gas law, 



isothermal: dT = 
isochoric: dV = 
isobaric: dp = 



pV = Nk B T , 



(2.1) 
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10" a S 



P-Po = F/A 



Figure 2.2: The pressure p of a gas is due to an average over space and time of the impulses due to the constituent 
particles. 



relating the four state variables T, p, V, and N. Here fc B = 1.3806503 x 10 16 erg/K is Boltzmann's constant. 
Another example is the van der Waals equation, 

aN 2 ' 



P- 



(V - bN) = Nk B T , 



(2.2) 



where a and b are constants which depend on the molecule which forms the gas. For a third example, consider a 
paramagnet, where 

^ = ^ , (2.3) 
V T 

where M is the magnetization, H the magnetic field, and C the Curie constant. 

Any quantity which, in equilibrium, depends only on the state variables is called a state function. For example, 
the total internal energy E of a thermodynamics system is a state function, and we may write E = E(T,p, V). 
State functions can also serve as state variables, although the most natural state variables are those which can be 
directly measured. 



2.2.2 Heat 

Once thought to be a type of fluid, heat is now understood in terms of the kinetic theory of gases, liquids, and 
solids as a form of energy stored in the disordered motion of constituent particles. The units of heat are therefore 
units of energy, and it is appropriate to speak of heat energy, which we shall simply abbreviate as heat'? 

1J = 10 7 erg = 6.242 x 10 18 eV = 2.390 x 10~ 4 kcal = 9.478 x 10~ 4 BTU . (2.4) 

We will use the symbol Q to denote the amount of heat energy absorbed by a system during some given ther- 
modynamic process, and dQ to denote a differential amount of heat energy. The symbol d indicates an 'inexact 
differential', about which we shall have more to say presently. This means that heat is not a state function: there 
is no 'heat function' Q(T,p, V). 

2 One calorie (cal) is the amount of heat needed to raise 1 g of H a O from T = 14.5° C to T ± = 15.5° C at a pressure of p = 1 atm. One 
British Thermal Unit (BTU) is the amount of heat needed to raise 1 lb. of H a O from T = 63° F to T x = 64° F at a pressure of p = 1 atm. 
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2.2.3 Work 

In general we will write the differential element of work dW done by the system as 

dW = Y J F l dX l , (2.5) 

i 

where F i is a generalized force and dX i a generalized displacement 3 . The generalized forces and displacements are 
themselves state variables, and by convention we will take the generalized forces to be intensive and the general- 
ized displacements to be extensive. As an example, in a simple one-component system, we have dW = p dV. More 
generally, we write 

- Ej Vj dX j E a M a dN a 

, A > , * > 

<JW = (pdF - If ■ dM - E ■ dP - odA + . . . ) - diVj + p 2 dN 2 + ...) (2.6) 

Here we distinguish between two types of work. The first involves changes in quantities such as volume, mag- 
netization, electric polarization, area, etc. The conjugate forces y i applied to the system are then — p, the magnetic 
field H, the electric field E, the surface tension a, respectively. The second type of work involves changes in the 
number of constituents of a given species. For example, energy is required in order to dissociate two hydrogen 
atoms in an H 2 molecule. The effect of such a process is dN H = — 1 and dN H = +2. 

As with heat, dW is an inexact differential, and work W is not a state variable, since it is path-dependent. There is 
no 'work function' W(T,p, V). 

2.2.4 Pressure and Temperature 

The units of pressure (p) are force per unit area. The SI unit is the Pascal (Pa): 1 Pa = 1 N/m 2 = 1 kg/ms 2 . Other 
units of pressure we will encounter: 

1 bar = 10 5 Pa 
latm = 1.01325 x 10 5 Pa 
ltorr = 133.3 Pa. 

Temperature (T) has a very precise definition from the point of view of statistical mechanics, as we shall see. Many 
physical properties depend on the temperature - such properties are called thermometric properties. For example, 
the resistivity of a metal p(T,p) or the number density of a gas n(T,p) are both thermometric properties, and can 
be used to define a temperature scale. Consider the device known as the 'constant volume gas thermometer' 
depicted in Fig. 2.3, in which the volume or pressure of a gas may be used to measure temperature. The gas 
is assumed to be in equilibrium at some pressure p, volume V, and temperature T. An incompressible fluid of 
density q is used to measure the pressure difference Ap = p — p Q , where p is the ambient pressure at the top of 
the reservoir: 

P-Po = Q9{h 2 - K) > ( 2 - 7 ) 

where g is the acceleration due to gravity. The height h 1 of the left column of fluid in the U-tube provides a 
measure of the change in the volume of the gas: 

V(h 1 ) = V(0)-Ah 1 , (2.8) 

where A is the (assumed constant) cross-sectional area of the left arm of the U-tube. The device can operate in two 
modes: 



3 We use the symbol d in the differential dW to indicate that this is not an exact differential. More on this in section 2.4 below. 
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Figure 2.3: The constant volume gas thermometer. The gas is placed in thermal contact with an object of temper- 
ature T. An incompressible fluid of density g is used to measure the pressure difference Ap = p gas — p . 



Constant pressure mode : The height of the reservoir is adjusted so that the height difference h 2 — h 1 is held 
constant. This fixes the pressure p of the gas. The gas volume still varies with temperature T, and we can 
define 

T V 

— = — ' (Z9) 

1 ref ^ref 

where T ref and V Icf are the reference temperature and volume, respectively. 

Constant volume mode : The height of the reservoir is adjusted so that h 1 = 0, hence the volume of the gas 
is held fixed, and the pressure varies with temperature. We then define 

^- = — , (2-10) 

1 ref Prel 

where T Tef and p ref are the reference temperature and pressure, respectively. 



What should we use for a reference? One might think that a pot of boiling water will do, but anyone who has 
gone camping in the mountains knows that water boils at lower temperatures at high altitude (lower pressure). 
This phenomenon is reflected in the phase diagram for H 2 0, depicted in Fig. 2.4. There are two special points in the 
phase diagram, however. One is the triple point, where the solid, liquid, and vapor (gas) phases all coexist. The 
second is the critical point, which is the terminus of the curve separating liquid from gas. At the critical point, the 
latent heat of transition between liquid and gas phases vanishes (more on this later on). The triple point temperature 
T t at thus unique and is by definition T t = 273.16 K. The pressure at the triple point is 611.7 Pa = 6.056 x 10~ 3 atm. 

A question remains: are the two modes of the thermometer compatible? E.g. it we boil water at p = p Q = 1 atm, 
do they yield the same value for T? And what if we use a different gas in our measurements? In fact, all these 
measurements will in general be incompatible, yielding different results for the temperature T. However, in the 
limit that we use a very low density gas, all the results converge. This is because all low density gases behave as 
ideal gases, and obey the ideal gas equation of state pV = Nk B T. 
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temperature T 



Figure 2.4: A sketch of the phase diagram of H2O (water). Two special points are identified: the triple point (T t ,p t ) 
at which there is three phase coexistence, and the critical point (T c ,p c ), where the latent heat of transformation 
from liquid to gas vanishes. Not shown are transitions between several different solid phases. 



2.2.5 Standard temperature and pressure 



It is customary in the physical sciences to define certain standard conditions with respect to which any arbitrary 
conditions may be compared. In thermodynamics, there is a notion of standard temperature and pressure, abbre- 
viated STP. Unfortunately, there are two different definitions of STP currently in use, one from the International 
Union of Pure and Applied Chemistry (IUPAC), and the other from the U.S. National Institute of Standards and 
Technology (NIST). The two standards are: 

IUPAC : T = 0°C = 273. 15 K , p = 10 5 Pa 
NIST : T = 20° C = 293.15 K , p = 1 atm = 1.01325 x 10 5 Pa 



To make matters worse, in the past it was customary to define STP as T Q = 0° C and p = 1 atm. We will use the 
NIST definition in this course. Unless I slip and use the IUPAC definition. Figuring out what I mean by STP will 
keep you on your toes. 

The volume of one mole of ideal gas at STP is then 

JV A fc B r = [22.711 £ (IUPAC) 
p [ 24.219 £ (NIST) , 



where 1 1 = 10 6 cm 3 = 10 -3 m 3 is one liter. Under the old definition of STP as T = 0° C and p = 1 atm, the 
volume of one mole of gas at STP is 22.414 I, which is a figure I remember from my 10 th grade chemistry class 
with Mr. Lawrence. 



24 



CHAPTER 2. THERMODYNAMICS 




(constant pressure) 



O2 (constant volume) 
N 2 (constant volume) 



number density n 

Figure 2.5: As the gas density tends to zero, the readings of the constant volume gas thermometer converge. 

2.3 The Zeroth Law of Thermodynamics 

Equilibrium is established by the exchange of energy, volume, or particle number between different systems or 
subsystems: 



energy exchange 
volume exchange 
particle exchange 
Equilibrium is transitive, so 



T = constant 
p = constant 
/1 = constant 



thermal equilibrium 
mechanical equilibrium 
chemical equilibrium 



If A is in equilibrium with B, and B is in equilibrium with C, then A is in equilibrium with C. 
This known as the Zeroth Law of Thermodynamics 4 . 



2.4 Mathematical Interlude : Exact and Inexact Differentials 



The differential 



dF = J2A i dx i 



(2.12) 



is called exact if there is a function F(x 1: . . . , x k ) whose differential gives the right hand side of eqn. 2.12. In this 
case, we have 



A, 



OF 

dx, 



dAj dAj 
dxj dx i 



(2.13) 



4 As we shall see further below, thermomechanical equilibrium in fact leads to constant p/T, and thermochemical equilibrium to constant 
n/T. If there is thermal equilibrium, then T is already constant, and so thermomechanical and thermochemical equilibria then guarantee the 
constancy of p and ^t. 
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path II 



path I 



x, 



A 

Figure 2.6: Two distinct paths with identical endpoints. 
For exact differentials, the integral between fixed endpoints is path-independent: 



JdF = F{x B 1 ,...,xl)-F(xt 



4) 



(2.14) 



from which it follows that the integral of dF around any closed path must vanish: 

>dF = 0. 



(2.15) 



When the cross derivatives are not identical, i.e. when dA i /dxj ^ dAj/dx it the differential is inexact. In this case, 
the integral of dF is path dependent, and does not depend solely on the endpoints. 



As an example, consider the differential 

dF = K 1 ydx + K 2 x dy . 
Let's evaluate the integral of dF, which is the work done, along each of the two paths in Fig. 2.6: 

X B Vb 

W m = K 1 Jdx y A + K 2 Jdy x B = K 1 y A (x B - x A ) + K 2 x B (y B - y A ) 

X A Va 

x b v B 

W (II> = K 1 Jdxy B + K 2 J dy x A = K 1 y B (x B - x A ) + K 2 x A (y B - y A ) . 



(2.16) 



(2.17) 



(2.18) 



Note that in general W {1) ^ W (n) . Thus, if we start at point A, the kinetic energy at point B will depend on the 
path taken, since the work done is path-dependent. 
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Q 



heat added 
to system 



AE = Q - W 



change in internal energy 



W 

work done 
by system 



Figure 2.7: The first law of thermodynamics is a statement of energy conservation. 



The difference between the work done along the two paths is 

W m - = jdF = (K 2 - K ± ) (x B - x A ) (y B - y A ) . (2.19) 

Thus, we see that if K 1 = K 2 , the work is the same for the two paths. In fact, if K 1 = K 2 , the work would be 
path-independent, and would depend only on the endpoints. This is true for any path, and not just piecewise 
linear paths of the type depicted in Fig. 2.6. Thus, if K x = K 2 , we are justified in using the notation dF for the 
differential in eqn. 2.16; explicitly, we then have F = K x xy. However, if K Y ^ K 2 , the differential is inexact, and 
we will henceforth write dF in such cases. 



2.5 The First Law of Thermodynamics 

2.5.1 Conservation of energy 

The first law is a statement of energy conservation, and is depicted in Fig. 2.7. It says, quite simply, that during 
a thermodynamic process, the change in a system's internal energy E is given by the heat energy Q added to the 
system, minus the work W done by the system: 

AE = Q - W . (2.20) 

The differential form of this, the First Law of Thermodynamics, is 

dE = dQ-dW . (2.21) 

We use the symbol if in the differentials dQ and dW to remind us that these are inexact differentials. The energy 
E, however, is a state function, hence dE is an exact differential. 

Consider a volume V of fluid held in a flask, initially at temperature T , and held at atmospheric pressure. The 
internal energy is then E = E(T Q ,p, V). Now let us contemplate changing the temperature in two different ways. 
The first method (A) is to place the flask on a hot plate until the temperature of the fluid rises to a value T x . The 
second method (B) is to stir the fluid vigorously. In the first case, we add heat Q A > but no work is done, so 
W A = 0. In the second case, if we thermally insulate the flask and use a stirrer of very low thermal conductivity, 
then no heat is added, i.e. Q B = 0. However, the stirrer does work — W B > on the fluid (remember W is the work 
done by the system). If we end up at the same temperature T lr then the final energy is E x = E(T 1: p, V) in both 
cases. We then have 

AE = E 1 -E = Q A = -W B . (2.22) 

It also follows that for any cyclic transformation, where the state variables are the same at the beginning and the 
end, we have 

A£ cyclic = Q - W = Q = W (cyclic). (2.23) 
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2.5.2 Single component systems 



A single component system is specified by three state variables. In many applications, the total number of particles 
N is conserved, so it is useful to take N as one of the state variables. The remaining two can be (T, V) or (T, p) or 
(p, V). The differential form of the first law says 



dE = dQ- aw 

= dQ-pdV + ^dN 



(2.24) 



The quantity fj, is called the chemical potential. Here we shall be interested in the case dN = so the last term will 
not enter into our considerations. We ask: how much heat is required in order to make an infinitesimal change in 
temperature, pressure, or volume? We start by rewriting eqn. 2.24 as 



dQ = dE+pdV -fidN . 
We now must roll up our sleeves and do some work with partial derivatives. 
• (T, V, N) systems : If the state variables are (T, V, N), we write 



Then 



dVJ, 



p 



T.N 



dV 



dE \ 



dN J, 



dN 



T,V 



8E \ 
dNJ, 



T,V 



dN 



(2.25) 



(2.26) 



(2.27) 



(T, p, N) systems : If the state variables are (T,p, N), we write 

'dE 



We also write 



Then 



dQ 



dE 



dT 



dT 



.N 



dT 



.N 



dE 
dT 



p,N 



dV 
df 



-oil dp 



'T.N 



dV\ 
dp). 



dp 



T.N 



p,N 



dT- 



dE\ + /dV\ 
d P Jt.n P \dp / TjA r 



dp 



8E\ fdV\ 
OnL +P {on)t, p -" 



dN 



(2.28) 



(2.29) 



(2.30) 



(p, V, N) systems : If the state variables are (p, V, N), we write 



Then 



dQ 



dE\ 

dp). 



dp 



V.N 



dE 
dV 



p,N 



dV 



dE \ 
dN) 



dN . 



p,V 



dE 
~dN 



P,V 



dN 



(2.31) 



(2.32) 



28 



CHAPTER 2. THERMODYNAMICS 





c p 


c p 

( T la TO 




c p 

( T /mnl TO 


Cp 
IT Itr T<"l 


Air 


29.07 


1.01 


H 2 (25 C) 


75.34 


4.181 


A lllTYlini 1TY1 
^1_L LI 1 1 Lll L LI 1 1 L 


24.2 
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37.47 
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Copper 


24.47 


0.385 


Iron 


25.1 


0.450 


C0 2 


36.94 


0.839 


Lead 


26.4 


0.127 


Diamond 


6.115 


0.509 


Lithium 


24.8 


3.58 


Ethanol 


112 


2.44 


Neon 


20.786 


1.03 


Gold 


25.42 


0.129 


Oxygen 


29.38 


0.918 


Helium 


20.786 


5.193 


Paraffin (wax) 


900 


2.5 


Hydrogen 


28.82 


5.19 


Uranium 


27.7 


0.116 


H 2 (-10° C) 


38.09 


2.05 


Zinc 


25.3 


0.387 



Table 2.1: Specific heat (at 25° C, unless otherwise noted) of some common substances. (Source: Wikipedia.) 



The heat capacity of a body C, is by definition the ratio SQ j dT of the amount of heat absorbed by the body to the 
associated infinitesimal change in temperature dT. The heat capacity will in general be different if the body is 
heated at constant volume or at constant pressure. Setting dV = gives, from eqn. 2.27, 

dQ\ _ fdE\ 



Similarly if we set dp = 0, then eqn. 2.30 yields 



C„.„ H S1 =(111 + p(Z) (234) 



r ' N '\dT) rJ1 -\aT) p ^ t '\aT 



p,N 



Unless explicitly stated as otherwise, we shall assume that N is fixed, and will write C v for C v N and C p for C p N . 

The units of heat capacity are energy divided by temperature, e.g. J/K. The heat capacity is an extensive quantity, 
scaling with the size of the system. If we divide by the number of moles N/N A , we obtain the molar heat capacity, 
sometimes called the molar specific heat: c = Cjv, where v = N/N A is the number of moles of substance. Specific 
heat is also sometimes quoted in units of heat capacity per gram of substance. We shall define 

C c heat capacity per mole 

(2.35) 



mN M mass per mole 

Here m is the mass per particle and M is the mass per mole: M = N A m. 

Suppose we raise the temperature of a body from T = T A to T = T B . How much heat is required? We have 

Q = JdTC(T) , (2.36) 

where C = C v or C — C p depending on whether volume or pressure is held constant. For ideal gases, as we shall 
discuss below, C(T) is constant, and thus 



Q = C(T B - TJ 



(2.37) 
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Figure 2.8: Heat capacity CV for one mole of hydrogen (H2) gas. At the lowest temperatures, only translational 
degrees of freedom are relevant, and / = 3. At around 200 K, two rotational modes are excitable and / = 5. Above 
1000 K, the vibrational excitations begin to contribute. Note the logarithmic temperature scale. (Data from H. W. 
Wooley et at, Jour. Natl. Bureau of Standards, 41, 379 (1948).) 

In metals at very low temperatures one finds C = yT, where 7 is a constant 5 . We then have 



^ B 

Q= [dTC(T) = ± 7 (T£-Tt 



T B = v/T A 2 + 2 7 -iQ. 



(2.38) 
(2.39) 



2.5.3 Ideal gases 



The ideal gas equation of state is pV = Nk B T. In order to invoke the formulae in eqns. 2.27, 2.30, and 2.32, we 
need to know the state function E(T, V, N). A landmark experiment by Joule in the mid-19th century established 
that the energy of a low density gas is independent of its volume 6 . Essentially, a gas at temperature T was allowed 
to freely expand from one volume V to a larger volume V > V, with no added heat Q and no work W done. 
Therefore the energy cannot change. What Joule found was that the temperature also did not change. This means 
that E(T, V, N) = E{T, N) cannot be a function of the volume. 

Since E is extensive, we conclude that 

E(T, V,N) = ve{T) , (2.40) 

where v = N/N A is the number of moles of substance. Note that v is an extensive variable. From eqns. 2.33 and 
2.34, we conclude 

C v {T) = ve'{T) , C p {T) = C v {T) + yR, (2.41) 

where we invoke the ideal gas law to obtain the second of these. Empirically it is found that C v (T) is temperature 
independent over a wide range of T, far enough from boiling point. We can then write C v = v c v , where v = N/N A 
is the number of moles, and where c v is the molar heat capacity. We then have 



R 



(2.42) 



5 In most metals, the difference between C y and C p is negligible. 
6 See the description in E. Fermi, Thermodynamics, pp. 22-23. 
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Figure 2.9: Molar heat capacities cy for three solids. The solid curves correspond to the predictions of the Debye 
model, which we shall discuss later. 



where R = N A k B = 8.31457 J/molK is the gas constant. We denote by 7 = c p /c v the ratio of specific heat at 
constant pressure and at constant volume. 

From the kinetic theory of gases, one can show that 

monatomic gases: c v = |i? , c p = |i? , 7 = | 

diatomic gases: c v = |i£ , c p = |i? , 7 = | 

polyatomic gases: c v = 3R , c p = AR , 7 = | . 



Digression : kinetic theory of gases 

We will conclude in general from noninteracting classical statistical mechanics that the specific heat of a substance 
is c v = \jR, where / is the number of phase space coordinates, per particle, for which there is a quadratic kinetic 
or potential energy function. For example, a point particle has three translational degrees of freedom, and the 
kinetic energy is a quadratic function of their conjugate momenta: H = (p^ +p 2 + p 2 z )/2m. Thus, / = 3. Diatomic 
molecules have two additional rotational degrees of freedom - we don't count rotations about the symmetry axis 
- and their conjugate momenta also appear quadratically in the kinetic energy, leading to / = 5. For polyatomic 
molecules, all three Euler angles and their conjugate momenta are in play, and / = 6. 

The reason that / = 5 for diatomic molecules rather than / = 6 is due to quantum mechanics. While translational 
eigenstates form a continuum, or are quantized in a box with Ak a = 2n/L a being very small, since the dimensions 
L a are macroscopic, angular momentum, and hence rotational kinetic energy, is quantized. For rotations about a 
principal axis with very low moment of inertia /, the corresponding energy scale ti 2 /2I is very large, and a high 
temperature is required in order to thermally populate these states. Thus, degrees of freedom with a quantization 
energy on the order or greater than e are 'frozen out' for temperatures T < e Q /k B . 

In solids, each atom is effectively connected to its neighbors by springs; such a potential arises from quantum 
mechanical and electrostatic consideration of the interacting atoms. Thus, each degree of freedom contributes 
to the potential energy, and its conjugate momentum contributes to the kinetic energy. This results in / = 6. 
Assuming only lattice vibrations, then, the high temperature limit for c v (T) for any solid is predicted to be 3R = 
24.944 J/molK. This is called the Dulong-Petit law. The high temperature limit is reached above the so-called Debye 
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temperature, which is roughly proportional to the melting temperature of the solid. 

In table 2.1, we list c p and c p for some common substances at T = 25° C (unless otherwise noted). Note that 
c p for the monatomic gases He and Ne is to high accuracy given by the value from kinetic theory c p = |i? = 
20.7864 J/molK. For the diatomic gases oxygen (0 2 ) and air (mostly N 2 and 2 ), kinetic theory predicts c p = 
|ii = 29.10, which is close to the measured values. Kinetic theory predicts c p = AR = 33.258 for polyatomic gases; 
the measured values for C0 2 and H 2 are both about 10% higher. 



2.5.4 Adiabatic transformations of ideal gases 

Assuming dN = and E = v e(T), eqn. 2.27 tells us that 

dQ = C v dT +pdV . (2.43) 
Invoking the ideal gas law to write p = vRT/V, and remembering C v = v c v , we have, setting dQ = 0, 

dT R d v 

-7fT + — -rr = . 2.44 

T Cy V 

We can immediately integrate to obtain 

{TV' 1 = constant 
pV 1 = constant (2.45) 
Tip 1 '"! = constant 

where the second two equations are obtained from the first by invoking the ideal gas law. These are all adiabatic 
equations of state. Note the difference between the adiabatic equation of state d(pV) = and the isothermal 
equation of state d(pV) = 0. Equivalently, we can write these three conditions as 

V a Tf = V?Tj , p f V f+2 =p f V f+2 , Tf +2 p- 2 = T{ +2 p^ 2 . (2.46) 

It turns out that air is a rather poor conductor of heat. This suggests the following model for an adiabatic atmosphere. 
The hydrostatic pressure decrease associated with an increase dz in height is dp = — gg dz, where g is the density 
and g the acceleration due to gravity. Assuming the gas is ideal, the density can be written as g = Mp/RT, where 
M is the molar mass. Thus, 

^^^dz. (2.47) 
p RT v ' 

If the height changes are adiabatic, then, from d(T 7 p 1_7 ) = 0, we have 

dT= l_±Tdp = _l_±Ml dz ^ (24g) 
7 p 7 it 



with the solution 



where T = T(0) is the temperature at the earth's surface, and 

A = ^ . (2.50) 

Mg 

WithM = 28.88 g and 7 = \ for air, and assuming T = 293 K, we find A = 8.6 km, and dT/dz = -(I-7- 1 ) T /A = 
— 9.7 K/km. Note that in this model the atmosphere ends at a height z max = 7 A/ (7 — 1) = 30 km. 
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Again invoking the adiabatic equation of state, we can find p(z): 



P^)-f T ^ = ( 1 _l^ll ] (2 . 51) 



Po \ T oJ \ 7 A 

Recall that 

e x = lim (l + ~) . (2.52) 

fe^oo V kJ 

Thus, in the limit 7 — * 1, where k = 7/(7 — 1) — * 00, we have p(z) = p exp(— z/X). Finally, since g cx p/T from the 
ideal gas law, we have 

'M-'l-Izif^. (2.53) 



2.5.5 Adiabatic free expansion 

Consider the situation depicted in Fig. 2.10. A quantity (y moles) of gas in equilibrium at temperature T and 
volume Vi is allowed to expand freely into an evacuated chamber of volume V 2 by the removal of a barrier. 
Clearly no work is done on or by the gas during this process, hence W = 0. If the walls are everywhere insulating, 
so that no heat can pass through them, then Q = as well. The First Law then gives AE = Q — W = 0, and there 
is no change in energy. 

If the gas is ideal, then since E(T, V, N) = Nc v T, then AE = gives AT = 0, and there is no change in tem- 
perature. (If the walls are insulating against the passage of heat, they must also prevent the passage of particles, 
so AN = 0.) There is of course a change in volume: AV = V 2 , hence there is a change in pressure. The initial 
pressure is p = Nk B T/V 1 and the final pressure is p' = Nk B T/ (F 2 + V 2 ). 

If the gas is nonideal, then the temperature will in general change. Suppose, for example, that E(T, V, N) = 
a V x N 1 ~ x T y , where a, x, and y are constants. This form is properly extensive: if V and N double, then E doubles. 
If the volume changes from V to V under an adiabatic free expansion, then we must have, from AE = 0, 




(2.54) 



If x/y > 0, the temperature decreases upon the expansion. If x/y < 0, the temperature increases. Without an 
equation of state, we can't say what happens to the pressure. 

Adiabatic free expansion of a gas is a spontaneous process, arising due to the natural internal dynamics of the system. 
It is also irreversible. If we wish to take the gas back to its original state, we must do work on it to compress it. If 
the gas is ideal, then the initial and final temperatures are identical, so we can place the system in thermal contact 
with a reservoir at temperature T and follow a thermodynamic path along an isotherm. The work done on the gas 
during compression is then 

W = -Nk B T j^r = Nk B T \n(Yi) = Nk B T Infl + (2.55) 

The work done by the gas is W — Jp dV = — W. During the compression, heat energy Q = W < is transferred to 
the gas from the reservoir. Thus, Q = W > is given off by the gas to its environment. 
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v 2 



Figure 2.10: In the adiabatic free expansion of a gas, there is volume expansion with no work or heat exchange 
with the environment: AE = Q = W — 0. 

2.6 Heat Engines and the Second Law of Thermodynamics 

2.6.1 There's no free lunch so quit asking 

A heat engine is a device which takes a thermodynamic system through a repeated cycle which can be represented 
as a succession of equilibrium states: A — > B — > C • • ■ — > A. The net result of such a cyclic process is to convert 
heat into mechanical work, or vice versa. 

For a system in equilibrium at temperature T, there is a thermodynamically large amount of internal energy 
stored in the random internal motion of its constituent particles. Later, when we study statistical mechanics, we 
will see how each 'quadratic' degree of freedom in the Hamiltonian contributes ^k B T to the total internal energy. 
An immense body in equilibrium at temperature T has an enormous heat capacity C, hence extracting a finite 
quantity of heat Q from it results in a temperature change AT = —Q/C which is utterly negligible. Such a body 
is called a heat bath, or thermal reservoir. A perfect engine would, in each cycle, extract an amount of heat Q from the 
bath and convert it into work. Since AE = for a cyclic process, the First Law then gives W — Q. This situation is 
depicted schematically in Fig. 2.11. One could imagine running this process virtually indefinitely, slowly sucking 
energy out of an immense heat bath, converting the random thermal motion of its constituent molecules into 
useful mechanical work. Sadly, this is not possible: 

A transformation whose only final result is to extract heat from a source at fixed temperature and 
transform that heat into work is impossible. 

This is known as the Postulate of Lord Kelvin. It is equivalent to the postulate ofClausius, 

A transformation whose only result is to transfer heat from a body at a given temperature to a body at 
higher temperature is impossible. 



These postulates which have been repeatedly validated by empirical observations, constitute the Second Law of 
Thermodynamics . 
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heat bath „ heat bath 




(a) a perfect engine (b) a perfect waste of time 

Figure 2.11: A perfect engine would extract heat Q from a thermal reservoir at some temperature T and convert it 
into useful mechanical work W. This process is alas impossible, according to the Second Law of thermodynamics. 
The inverse process, where work W is converted into heat Q, is always possible. 



2.6.2 Engines and refrigerators 



While it is not possible to convert heat into work with 100% efficiency, it is possible to transfer heat from one 
thermal reservoir to another one, at lower temperature, and to convert some of that heat into work. This is what 
an engine does. The energy accounting for one cycle of the engine is depicted in the left hand panel of Fig. 2.12. 
An amount of heat Q 2 > is extracted- from the reservoir at temperature T 2 . Since the reservoir is assumed to 
be enormous, its temperature change AT 2 = —Q 2 /C 2 is negligible, and its temperature remains constant - this 
is what it means for an object to be a reservoir. A lesser amount of heat, Q lr with < Q 1 < Q 2 , is deposited 
in a second reservoir at a lower temperature T v Its temperature change AT X = +Q 1 /C\ is also negligible. The 
difference W — Q 2 — Q 1 is extracted as useful work. We define the efficiency, rj, of the engine as the ratio of the 
work done to the heat extracted from the upper reservoir, per cycle: 

W 1 Q 1 (2 56) 

Q 2 Q 2 

This is a natural definition of efficiency, since it will cost us fuel to maintain the temperature of the upper reservoir 
over many cycles of the engine. Thus, the efficiency is proportional to the ratio of the work done to the cost of the 
fuel. 

A refrigerator works according to the same principles, but the process runs in reverse. An amount of heat Qi is 
extracted from the lower reservoir - the inside of our refrigerator - and is pumped into the upper reservoir. As 
Clausius' form of the Second Law asserts, it is impossible for this to be the only result of our cycle. Some amount 
of work W must be performed on the refrigerator in order for it to extract the heat Q 1 . Since AE = for the cycle, 
a heat Q 2 = W + Qi must be deposited into the upper reservoir during each cycle. The analog of efficiency here 
is called the coefficient of refrigeration, k, defined as 

Thus, k is proportional to the ratio of the heat extracted to the cost of electricity, per cycle. 

Please note the deliberate notation here. I am using symbols Q and W to denote the heat supplied to the engine 
(or refrigerator) and the work done by the engine, respectively, and Q and W to denote the heat taken from the 
engine and the work done on the engine. 

A perfect engine has Q 1 = and r\ = 1; a perfect refrigerator has Q x = Q 2 and n = oo. Both violate the Second 
Law. Sadi Carnot 7 (1796 - 1832) realized that a reversible cyclic engine operating between two thermal reservoirs 



7 Camot died during cholera epidemic of 1832. His is one of the 72 names engraved on the Eiffel Tower. 
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(a) engine (b) refrigerator 

Figure 2.12: An engine (left) extracts heat Q2 from a reservoir at temperature T2 and deposits a smaller amount of 
heat Qi into a reservoir at a lower temperature Ti, during each cycle. The difference W = Q2 — Qi is transformed 
into mechanical work. A refrigerator (right) performs the inverse process, drawing heat Qi from a low tempera- 
ture reservoir and depositing heat Q2 = Q\ + W into a high temperature reservoir, where W is the mechanical (or 
electrical) work done per cycle. 



must produce the maximum amount of work W, and that the amount of work produced is independent of the 
material properties of the engine. We call any such engine a Carnot engine. 

The efficiency of a Carnot engine may be used to define a temperature scale. We know from Carnot's observations 
that the efficiency ?; c can only be a function of the temperatures T x and T 2 : rj c = r] c (T 1 ,T 2 ). We can then define 

^1 = 1-^(7^). (2.58) 

Below, in §2.6.4, we will see that how, using an ideal gas as the 'working substance' of the Carnot engine, this 
temperature scale coincides precisely with the ideal gas temperature scale from §2.2.4. 



2.6.3 Nothing beats a Carnot engine 

The Carnot engine is the most efficient engine possible operating between two thermal reservoirs. To see this, 
let's suppose that an amazing wonder engine has an efficiency even greater than that of the Carnot engine. A key 
feature of the Carnot engine is its reversibility - we can just go around its cycle in the opposite direction, creating 
a Carnot refrigerator. Let's use our notional wonder engine to drive a Carnot refrigerator, as depicted in Fig. 2.13. 

We assume that 

W W 

7T = ri moudm > 77 Car „o t = t^t • ( 2 - 59 ) 

But from the figure, we have W = W", and therefore the heat energy Q' 2 — Q 2 transferred to the upper reservoir is 
positive. From 

W = Q 2 - Q 1 = Q' 2 - Q[ = W , (2.60) 

we see that this is equal to the heat energy extracted from the lower reservoir, since no external work is done on 
the system: 

S2 - Q2 = Q'i - Ci > • (2.61) 
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Figure 2.13: A wonder engine driving a Carnot refrigerator. 



Therefore, the existence of the wonder engine entails a violation of the Second Law. Since the Second Law is 
correct - Lord Kelvin articulated it, and who are we to argue with a Lord? - the wonder engine cannot exist. 

We further conclude that all reversible engines running between two thermal reservoirs have the same efficiency, which is 
the efficiency of a Carnot engine. For an irreversible engine, we must have 



W Q, Ti 



T 



Thus, 



Q 1 _Q 1<0 

T 2 T x ~ 



(2.62) 
(2.63) 



2.6.4 The Carnot cycle 

Let us now consider a specific cycle, known as the Carnot cycle, depicted in Fig. 2.14. The cycle consists of two 
adiabats and two isotherms. The work done per cycle is simply the area inside the curve on our p — V diagram: 




(2.64) 



The gas inside our Carnot engine is called the 'working substance'. Whatever it may be, the system obeys the First 
Law, 

dE = JQ - dW = JQ - pdV . (2.65) 

We will now assume that the working material is an ideal gas, and we compute W as well as Q 1 and Q 2 to find 
the efficiency of this cycle. In order to do this, we will rely upon the ideal gas equations, 

E = , pV = vRT , (2.66) 

7-1 

where 7 = c p /c v = 1 + j, where / is the effective number of molecular degrees of freedom contributing to the 
internal energy. Recall / = 3 for monatomic gases, / = 5 for diatomic gases, and / = 6 for polyatomic gases. The 
finite difference form of the first law is 

AE = E ( -E i = Q if - W if , (2.67) 
where i denotes the initial state and f the final state. 
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volume V 

Figure 2.14: The Carnot cycle consists of two adiabats (dark red) and two isotherms (blue). 



AB: This stage is an isothermal expansion at temperature T 2 . It is the 'power stroke' of the engine. We have 



dV ^ = vRT 2 In C£ 



E. — E„ 



vRT 2 



hence 



( y 



BC: This stage is an adiabatic expansion. We have 

Qbc = 



vR 



m - t 2 ) 



(2.68) 
(2.69) 

(2.70) 

(2.71) 
(2.72) 



The energy change is negative, and the heat exchange is zero, so the engine still does some work during this 
stage: 

W BC = Q BC - AE BC = (T 2 -Tj). (2.73) 

7-1 

CD: This stage is an isothermal compression, and we may apply the analysis of the isothermal expansion, mutatis 
mutandis: 



Vn 



vRT, 



hence 



E ° = E » = 7 _ i ' 



(2.74) 
(2.75) 

(2.76) 
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DA: This last stage is an adiabatic compression, and we may draw on the results from the adiabatic expansion in 
BC: 

Q DA = (2.77) 

AE VA = E -E A = ^-(T 2 - r x ) . (2.78) 
7-1 

The energy change is positive, and the heat exchange is zero, so work is done on the engine: 

W DA = Q DA - A£ DA = J^- (Ti - T 2 ) . (2.79) 

7-1 

We now add up all the work values from the individual stages to get for the cycle 

w = w AB + w BC + w co + W VA 

= vRT 2 hx(^f] + vRT 1 \J^ 



Since we are analyzing a cyclic process, we must have AE = 0, we must have Q = W, which can of course be 
verified explicitly, by computing Q = Q AB + Q BC + Q CD + Q DA . To finish up, recall the adiabatic ideal gas equation 
of state, dfW^ 1 ) = 0. This tells us that 



r a vg'- 1 =T 1 v3'- 1 (2.8i) 

T 2 V?- 1 =T 1 V3- 1 . (2.82) 



Dividing these two equations, we find 



and therefore 



(2.83) 



Finally the efficiency is given by the ratio of these two quantities: 



W = vR(T 2 - T x ) In (2.84) 
Qab = vRT 2 In (Q) . (2.85) 



W T-t 

V = 7^ = 1 -^- (2-86) 



2 



2.6.5 The Stirling cycle 

Many other engine cycles are possible. The Stirling cycle, depicted in Fig. 2.15, consists of two isotherms and two 
isochores. Recall the isothermal ideal gas equation of state, d(pV) = 0. Thus, for an ideal gas Stirling cycle, we 
have 

P A Vi=P*V 2 , PdVi = Pc V 2 , (2.87) 

which says 

PjL = Pc = Yl. (2.88) 
Pa Pv y 2 
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volume V 

Figure 2.15: A Stirling cycle consists of two isotherms (blue) and two isochores (green). 



AB: This isothermal expansion is the power stroke. Assuming v moles of ideal gas throughout, we have pV 
uRT 2 =p 1 V v hence 



vRT 2 /Vn 



Since AB is an isotherm, we have E A = E B , and from AE AB = we conclude Q AB = W AB . 
BC: Isochoric cooling. Since dV = we have W BC = 0. The energy change is given by 



E„ — E„ 



7-1 



which is negative. Since W BC = 0, we have Q BC = AE BC . 
CD: Isothermal compression. Clearly 

v, 



W 



Since CD is an isotherm, we have E c = E D , and from AE CD = we conclude Q CD = W CD . 
DA: Isochoric heating. Since dV = we have W DA = 0. The energy change is given by 

vR(T 2 - T x ) 



AS„ A = E . — E„ = 



7-1 



which is positive, and opposite to A_B BC . Since W DA = 0, we have Q DA = AE DA . 



We now add up all the work contributions to obtain 



w = w AB + W BC 



vR(T 2 -TJ hi 



w +w 

ry CD ' DA 



V 



(2.89) 



(2.90) 



(2.91) 



(2.92) 



(2.93) 



40 



CHAPTER 2. THERMODYNAMICS 




volume V 

Figure 2.16: An Otto cycle consists of two adiabats (dark red) and two isochores (green). 



The cycle efficiency is once again 



V = 



W 

o7u 



To 



(2.94) 



2.6.6 The Otto and Diesel cycles 

The Otto cycle is a rough approximation to the physics of a gasoline engine. It consists of two adiabats and two 
isochores, and is depicted in Fig. 2.16. Assuming an ideal gas, along the adiabats we have d{pV 1 ) = 0. Thus, 



PaW=P b V? 



which says 



Pb = Pc 

Pa Pu 



p»v?=p c vj 



(2.95) 
(2.96) 



AB: Adiabatic expansion, the power stroke. The heat transfer is Q AB = 0, so from the First Law we have W J 

-AE AB = E A - E B , thus 

Pa^i ~P b V 2 _ PpYi 



1 TT~ 



7-1 



7 — 1 7 — 1 

Note that this result can also be obtained from the adiabatic equation of state pV 1 = p A V^: 

v, v 2 



(2.97) 



pdV =p A V? / dW' 1 



P A Vi 
7-1 



1 - 



7-1 



(2.98) 



BC: Isochoric cooling (exhaust); dV = hence W BC = 0. The heat Q BC absorbed is then 



QbC - E C - E B 



v 2 

7-1 



(Pc - Pb) 



(2.99) 



In a realistic engine, this is the stage in which the old burned gas is ejected and new gas is inserted. 
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volume V 

Figure 2.17: A Diesel cycle consists of two adiabats (dark red), one isobar (light blue), and one isochore (green). 



CD: Adiabatic compression; Q CD = and W CD — E c — E D : 



7-1 



7-1 



1 - 



7-1 



(2.100) 



DA: Isochoric heating, i.e. the combustion of the gas. As with BC we have dV = 0, and thus W DA = 0. The heat 
Q DA absorbed by the gas is then 



Q DA = E A -E D = -^-(p A -p D ) 
7-1 



(2.101) 



The total work done per cycle is then 



W = W„ 



W +W 

BC ' CD 1 " DA 



(Pa -PuWl 
7-1 



1 - 



7-1 



(2.102) 



and the efficiency is defined to be 



IT' 



= 1-1^ 

V 2 



7-1 



(2.103) 



The ratio V 2 /V 1 is called the compression ratio. We can make our Otto cycle more efficient simply by increasing the 
compression ratio. The problem with this scheme is that if the fuel mixture becomes too hot, it will spontaneously 
'preignite', and the pressure will jump up before point D in the cycle is reached. A Diesel engine avoids preignition 
by compressing the air only, and then later spraying the fuel into the cylinder when the air temperature is sufficient 
for fuel ignition. The rate at which fuel is injected is adjusted so that the ignition process takes place at constant 
pressure. Thus, in a Diesel engine, step DA is an isobar. The compression ratio is r = V B /V D , and the cutoff ratio 
is s = V A /V D . This refinement of the Otto cycle allows for higher compression ratios (of about 20) in practice, and 
greater engine efficiency. 
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For the Diesel cycle, we have, briefly, 



W=P A (V A -V D ) + 



Pa V a-Pb V b , PcVc - Pu V u 



and 



7 — 1 7 — 1 

JPa{V a ~ Vp) _ (P B -Pc)^b 
7 — 1 7 — 1 

1P A (V A -V D ) 



Qr 



7-1 



(2.104) 



(2.105) 



To find the efficiency, we will need to eliminate p B and p c in favor of p A using the adiabatic equation of state 
d(p\n) = 0. Thus, 

V, 



I f- J ( 2 - 106 ) 



where we've used p D = p A and V c — V B . Putting it all together, the efficiency of the Diesel cycle is 

_ W 1 r 1 ^( S T - 1) 

11 Q DA 7 s - 1 



(2.107) 



2.6.7 The Joule-Brayton cycle 

Our final example is the Joule-Brayton cycle, depicted in Fig. 2.18, consisting of two adiabats and two isobars. 
Along the adiabats we have Thus, 



which says 



P2V2=PiV3 



V V 



p 2 v^= Pl vj, 



(2.108) 



(2.109) 



AB: This isobaric expansion at p = p 2 is the power stroke. We have 

JdVp 2 =p 2 (V B -V A ) 

V A 

P2 (Vb - 7 A ) 



W A 



A£ ab = E s - E A 



'AB = A£ AB + VF AB 



7-1 

7-1 



BC: Adiabatic expansion; Q BC = and W B0 = E B — E c . The work done by the gas is 



7-1 7 - 1 V. ^2^1 



P2K 
7-1 



1-1^ 

P 2 



1-7- 



(2.110) 

(2.111) 
(2.112) 



(2.113) 
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Figure 2.18: A Joule-Brayton cycle consists of two adiabats (dark red) and two isobars (light blue). 
CD: Isobaric compression at p = p v 



W 



AE CD = E D - E c 



dV Pl = Pl (V D - V c ) = -p 2 (V B - V A ) 



pi (y D - v c ) 



Qcu = &e cu + W CD 



7-1 
1P 2 



1-7- 1 



(2.114) 

(2.115) 
(2.116) 



BC: Adiabatic expansion; Q DA = and W DA = E D — E A . The work done by the gas is 



W - Pi V °~ p 2 V A P2 V a Pi Ye. 

7-1 \ P 2 'V A 
i_ 7 -' 



7-1 

P 2 V A 
7-1 



1 



(2.117) 



The total work done per cycle is then 



w = w +w +w +w 

yv "ad t "bc t "cd ' "da 



and the efficiency is defined to be 



IP2 (Vb ~ Va 
7-1 



1 - 



1-7- 



Qab \P 2 



1-7- 1 



(2.118) 



(2.119) 
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2.6.8 Carnot engine at maximum power output 

While the Carnot engine described above in §2.6.4 has maximum efficiency, it is practically useless, because the 
isothermal processes must take place infinitely slowly in order for the working material to remain in thermal 
equilibrium with each reservoir. Thus, while the work done per cycle is finite, the cycle period is infinite, and the 
engine power is zero. 

A modification of the ideal Carnot cycle is necessary to create a practical engine. The idea 8 is as follows. During 
the isothermal expansion stage, the working material is maintained at a temperature T 2w < T 2 . The temperature 
difference between the working material and the hot reservoir drives a thermal current, 

^ = k 2 (T 2 -T 2w ). (2.120) 

Here, k 2 is a transport coefficient which describes the thermal conductivity of the chamber walls, multiplied by a 
geometric parameter (which is the ratio of the total wall area to its thickness). Similarly, during the isothermal 
compression, the working material is maintained at a temperature T lw > T lr which drives a thermal current to 
the cold reservoir, 

^=« 1 (T lw -T 1 ). (2.121) 

Now let us assume that the upper isothermal stage requires a duration At 2 and the lower isotherm a duration 
At v Then 

Q 2 = n 2 Ai 2 (T 2 - T 2w ) (2.122) 
Q 1 =K 1 At 1 (T lw -T 1 ). (2.123) 



Since the engine is reversible, we must have 
which says 



Qi_ _ Qjl. 



(2.124) 



(2.125) 



At l = K 2 T 2w ( T lw~ T l) 
At 2 K l T lw ( T 2 - T 2w) ' 

The power is 

P = Q i - g i (2.126) 

(1 + a) (Aij + At 2 ) V 

where we assume that the adiabatic stages require a combined time of a (Ai x + At 2 ). Thus, we find 

~ (1 + a) [«! T 2 (T lw - T x ) + n 2 T x (T 2 - T 2w ) + (k 2 - (T 1w - T x ) (T 2 - T 2w )] 
We optimize the engine by maximizing P with respect to the temperatures T lw and T 2w . This yields 

T 2w = T 2 - f 2 ~ (2.128) 

1 + V K 2/ K l 

T lw =T 1+ ; 1 . (2.129) 

1 + V K l/ K 2 



8 See F. L. Curzon and B. Ahlborn, Am. J. Phys. 43, 22 (1975). 



2.7. THE ENTROPY 



45 



Power source 


T x (°C) 


T 2 (°C) 


^Carnot 


r] (theor.) 


?] (obs.) 


West Thurrock (UK) 
Coal Fired Steam Plant 


- 25 


565 


0.641 


0.40 


0.36 


CANDU (Canada) 
PHW Nuclear Reactor 


- 25 


300 


0.480 


0.28 


0.30 


Larderello (Italy) 
Geothermal Steam Plant 


- 80 


250 


0.323 


0.175 


0.16 



Table 2.2: Observed performances of real heat engines, taken from table 1 from Curzon and Albhorn (1975). 



The efficiency at maximum power is then 



One also finds at maximum power 




(2.130) 



(2.131) 



Finally the maximized power is 




(2.132) 



Table 2.2, taken from the article of Curzon and Albhorn (1975), shows how the efficiency of this practical Carnot 
cycle, given by eqn. 2.130, rather accurately predicts the efficiencies of functioning power plants. 



2.7 The Entropy 
2.7.1 Entropy and heat 

The Second Law guarantees us that an engine operating between two heat baths at temperatures T : and T 2 must 
satisfy 

^ + ^<0, (2.133) 

with the equality holding for reversible processes. This is a restatement of eqn. 2.63, after writing Q x = — Q 1 for 
the heat transferred to the engine from reservoir #1. Consider now an arbitrary curve in the p — V plane. We can 
describe such a curve, to arbitrary accuracy, as a combination of Carnot cycles, as shown in Fig. 2.19. Each little 
Carnot cycle consists of two adiabats and two isotherms. We then conclude 

(2134) 

* 1 c 

with equality holding if all the cycles are reversible. Rudolf Clausius, in 1865, realized that one could then define a 
new state function, which he called the entropy, S, that depended only on the initial and final states of a reversible 
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1_ 

CO 

CO 

CD 
i_ 

Q. 



volume V 

Figure 2.19: An arbitrarily shaped cycle in the p — V plane can be decomposed into a number of smaller Carnot 
cycles. Red curves indicate isotherms and blue curves adiabats, with 7 = f ■ 

process: 

dS=f =► S n -S A = j§. (2.135) 

A 

Since Q is extensive, so is S; the units of entropy are [S] = J/K. 



2.7.2 The Third Law of Thermodynamics 

Eqn. 2.135 determines the entropy up to a constant. By choosing a standard state T, we can define S r = 0, and 
then by taking A = T in the above equation, we can define the absolute entropy S for any state. However, it 
turns out that this seemingly arbitrary constant S T in the entropy does have consequences, for example in the 
theory of gaseous equilibrium. The proper definition of entropy, from the point of view of statistical mechanics, 
will lead us to understand how the zero temperature entropy of a system is related to its quantum mechanical 
ground state degeneracy. Walther Nernst, in 1906, articulated a principle which is sometimes called the Third 
Law of Thermodynamics, 

The entropy of every system at absolute zero temperature always vanishes. 

Again, this is not quite correct, and quantum mechanics tells us that S(T = 0) = k B In g, where g is the ground 
state degeneracy. Nernst's law holds when g = 1. 

We can combine the First and Second laws to write 




dE + dW = tiQ < TdS , 
where the equality holds for reversible processes. 



(2.136) 
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2.7.3 Entropy changes in cyclic processes 

For a cyclic process, whether reversible or not, the change in entropy around a cycle is zero: AS CYC = 0. This 
is because the entropy S is a state function, with a unique value for every equilibrium state. A cyclical process 
returns to the same equilibrium state, hence S must return as well to its corresponding value from the previous 
cycle. 

Consider now a general engine, as in Fig. 2.12. Let us compute the total entropy change in the entire Universe 
over one cycle. We have 

(A5) TOTAL = (AS) ENGINE + (AS) HOT + (A5) COLD , (2.137) 

written as a sum over entropy changes of the engine itself, the hot reservoir, and the cold reservoir 9 . Clearly 
(AS) ENGmE = 0. The changes in the reservoir entropies are 

(AS) SOT = J *2p = J| < o (2.138) 

T=T 2 

(A5) COLD = J = ^ = -9± > , (2.139) 

T=T 1 

because the hot reservoir loses heat Q 2 > to the engine, and the cold reservoir gains heat Q 1 = — Q x > from 
the engine. Therefore, 

(A5) TOTAL = -(^ + ^)>0. (2.140) 

Thus, for a reversible cycle, the net change in the total entropy of the engine plus reservoirs is zero. For an 
irreversible cycle, there is an increase in total entropy, due to spontaneous processes. 



2.7.4 Gibbs-Duhem relation 

Recall eqn. 2.6: 



m = - J2 Vj dX 3 -Y,Va dN a . (2.141) 



For reversible systems, we can therefore write 

dE = T dS + Vj dX j + ^ dN a . (2.142) 

o a 

This says that the energy E is a function of the entropy S, the generalized displacements {Xj}, and the particle 
numbers {N a }: 

E = E(S, {Xj}, {N a }) . (2.143) 

Furthermore, we have 

m ( dE \ ( 9E \ ( dE \ 

Since E and all its arguments are extensive, we have 

XE = E(XS,{XX j },{XN a }) . (2.145) 



9 We neglect any interfacial contributions to the entropy change, which will be small compared with the bulk entropy change in the ther- 
modynamic limit of large system size. 
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We now differentiate the LHS and RHS above with respect to A, setting A = 1 afterward. The result is 

dE 
~dS 



E - S w + L. x inx + 2^ N - 



dX, ^ a 8N a 

J a " 



(2.146) 



Mathematically astute readers will recognize this result as an example of Euler's theorem for homogeneous func- 
tions. Taking the differential of eqn. 2.146, and then subtracting eqn. 2.142, we obtain 



SdT + J2 x j dy j + J2 N - d fa = o 



(2.147) 



This is called the Gibbs-Duhem relation. It says that there is one equation of state which may be written in terms of 
all the intensive quantities alone. For example, for a single component system, we must have p = p(T, /i), which 
follows from 

SdT- Vdp + Ndu = . (2.148) 



2.7.5 Entropy for an ideal gas 

For an ideal gas, we have E = ^fNk B T, and 

dS=±dE + £dV-£dN 

= l 2 fNk B ^ + ^dv+ [\fk B 

Invoking the ideal gas equation of state pV = Nk B T, we have 



T 



(2.149) 



dN . 



dS 



N 2 



\fNk B d\nT + Nk B d\nV . 



(2.150) 



Integrating, we obtain 

S(T,V,N) = \fNk B In T + Nk B \nV + tp(N) , (2.151) 
where <p(N) is an arbitrary function. Extensivity of S places restrictions on <p(N), so that the most general case is 

S(T, V, N) = \fNk B lnT + Nk B In (J^J + Na , 

where a is a constant. Equivalently, we could write 

S(E, V, N) = \fNk B + Nk B In + Nb , 



(2.152) 



(2.153) 



where b = a — \fk B ln(i/fc B ) is another constant. When we study statistical mechanics, we will find that for the 
monatomic ideal gas the entropy is 



S(T,V,N) = Nk B 



V 



(2.154) 



where A T = ^2itK 2 /mk B T is the thermal wavelength, which involved Planck's constant. Let's now contrast two 
illustrative cases. 
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Adiabatic free expansion - Suppose the volume freely expands from to V f = r V ir with r > 1. Such an 
expansion can be effected by a removal of a partition between two chambers that are otherwise thermally 
insulated (see Fig. 2.10). We have already seen how this process entails 

AE = Q = W = . (2.155) 

But the entropy changes! According to eqn. 2.153, we have 

AS = S t -S l = Nk B In r . (2.156) 

Reversible adiabatic expansion - If the gas expands quasistatically and reversibly then S = S(E, V, N) holds 
everywhere along the thermodynamic path. We then have, assuming dN — 0, 



= dS = yNk B ^ + Nk B ^- 



Integrating, we find 



Thus, 



= Nk B d\n(VE f/2 



E /V 2// 



dE sri dV 
h Nk„ 

E B V (2.157) 



E \V 



(2.158) 

E { = r~ 2/f Ei T s = r- 2/f T ; . (2.159) 



2.7.6 Example system 

Consider a model thermodynamic system for which 



where a is a constant. We have 
and therefore 



E(S,V,N) = ^, (2.160) 



dE = TdS -pdV + fidN 7 (2.161) 

(dE\ 3a5 2 

T= {dsl N =NV (2162) 

( dE \ aS3 

P = -Wl N = NV> (Z163) 

' 9E ^ (2.164) 



dN J s v N 2 V ' 

Choosing any two of these equations, we can eliminate S, which is inconvenient for experimental purposes. This 
yields three equations of state, 

T 3 V T 3 N v N 

— =27a- , — = 27a- , f = -- , (2.165) 

p z N fi 2 V n V 

only two of which are independent. 
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What about C v and C p ? To find C v , we recast eqn. 2.162 as 

/ NVT 
V 3a J 



( NVT\ 1/2 

S = — — . (2.166) 



(-) = 






v 1 / 2 N 


T 2 


\dT) v . N 




V 3a , 


1 ~ 18a 


P 



We then have 

Cy='. 

where the last equality on the RHS follows upon invoking the first of the equations of state in eqn. 2.165. To find 
C p , we eliminate V from eqns. 2.162 and 2.163, obtaining T 2 /p = 9aS/N. From this we obtain 

Thus, C p /C v = 4. 

We can derive still more. To find the isothermal compressibility n T = — y {^) T N , use the first of the equations of 
state in eqn. 2.165. To derive the adiabatic compressibility k s = —y{^f-) s N , use eqn. 2.163, and then eliminate 
the inconvenient variable S. 

Suppose we use this system as the working substance for a Carnot engine. Let's compute the work done and 
the engine efficiency. To do this, it is helpful to eliminate S in the expression for the energy, and to rewrite the 
equation of state: 



dW\ T = J^^ /2 ^=2ffiL, (2.171) 



We assume dN = throughout. We now see that for isotherms, 

E 

dT = : —= = constant (2.170) 

w 

Furthermore, since 

lT V 27a VVa ^ 
we conclude that 

dT = : Wif = 2(Ef - E,) , Q if = E { -E { + W i{ = 3(E { - E t ) . (2.172) 
For adiabats, eqn. 2.162 says d(TV) = 0, and therefore 

dQ = : TV = constant , — = constant , EV = constant (2.173) 

as well as W if = E i — E { . We can use these relations to derive the following: 

fv~ T, [~V~ T n 

R = J-2-E. , E c , = -L.S.E. , E n = ^-E A . (2.174) 
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Now we can write 



W AB = 2(E B -E A ) = 2[ J^- 



W BC = (E B - E c ) 



W aD =2(E D -E a ) = 2?±ll-J^ I 



W DA = (E D - E A 



T, 



(2.175) 
(2.176) 
(2.177) 
(2.178) 



Adding up all the work, we obtain 



w = w AB + W BC + W CD + W DA 



(2.179) 



Since 



(2.180) 



we find once again 



W 7\ 

7] = = 1 



(2.181) 



2.7.7 Measuring the entropy of a substance 



If we can measure the heat capacity C V {T) or CJT) of a substance as a function of temperature down to the 
lowest temperatures, then we can measure the entropy. At constant pressure, for example, we have T dS = C p dT, 
hence 



S(p,T) = S(p,T = 0) + JdT'^- 



(2.182) 



The zero temperature entropy is S(p, T = 0) = fc B In 5 where g is the quantum ground state degeneracy at pressure 
p. In all but highly unusual cases, g = 1 and S(p, T = 0) = 0. 



2.8 Thermodynamic Potentials 

Thermodynamic systems may do work on their environments. Under certain constraints, the work done may be 
bounded from above by the change in an appropriately defined thermodynamic potential. 
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2.8.1 Energy E 

Suppose we wish to create a thermodynamic system from scratch. Let's imagine that we create it from scratch in 
a thermally insulated box of volume V. The work we must to to assemble the system is then 

W = E . (2.183) 

After we bring all the constituent particles together, pulling them in from infinity (say), the system will have total 
energy E. After we finish, the system may not be in thermal equilibrium. Spontaneous processes will then occur 
so as to maximize the system's entropy, but the internal energy remains at E. 

We have, from the First Law, dE = dQ — dW . For equilibrium systems, we have 

dE = TdS -pdV + fidN , (2.184) 

which says that E = E(S, V, N), and 



The Second Law, in the form dQ <T dS, then yields 

dE <TdS -pdV + [idN . (2.186) 

This form is valid for single component systems and is easily generalized to multicomponent systems, or magnetic 
systems, etc. Now consider a process at fixed (S, V, N). We then have dE < 0. This says that spontaneous 
processes in a system with dS = dV = dN = always lead to a reduction in the internal energy E. Therefore, 
spontaneous processes drive the internal energy E to a minimum in systems at fixed (S, V, N). 

Allowing for other work processes, we have 

dW <TdS - dE . (2.187) 

Hence, the work done by a thermodynamic system under conditions of constant entropy is bounded above by —dE, and the 
maximum dW is achieved for a reversible process. 

It is useful to define the quantity 

dW bco = dW -pdV , (2.188) 
which is the differential work done by the system other than that required to change its volume. Then 

dW frec <TdS -pdV - dE , (2.189) 

and we conclude that for systems at fixed (S, V) that dW{ Tee < —dE. 



2.8.2 Helmholtz free energy F 

Suppose that when we spontaneously create our system while it is in constant contact with a thermal reservoir 
at temperature T. Then as we create our system, it will absorb heat from the reservoir. Therefore, we don't have 
to supply the full internal energy E, but rather only E — Q, since the system receives heat energy Q from the 
reservoir. In other words, we must perform work 



W = E — TS 



(2.190) 
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to create our system, if it is constantly in equilibrium at temperature T. The quantity E — TS is known as the 
Helmholtz free energy, F, which is related to the energy E by a Legendre transformation, 

F = E-TS . (2.191) 

The general properties of Legendre transformations are discussed in Appendix II, §2.16. 

Under equilibrium conditions, we have 

dF = -SdT -pdV + fidN . (2.192) 
Thus, F = F(T, V, N), whereas E = E(S, V, N), and 

In general, the Second Law tells us that 

dF < -SdT -pdV + pdN . (2.194) 

The equality holds for reversible processes, and the inequality for spontaneous processes. Therefore, spontaneous 
processes drive the Helmholtz free energy F to a minimum in systems at fixed (T, V, N). 

We may also write 

aW <-SdT-dF , (2.195) 

In other words, the work done by a thermodynamic system under conditions of constant temperature is bounded above by 
—dF, and the maximum dW is achieved for a reversible process. We also have 

dW fTec < -SdT -pdV - dF , (2.196) 

and we conclude, for systems at fixed (T, V), that &Wf tee < -dF. 



2.8.3 Enthalpy H 

Suppose that when we spontaneously create our system while it is thermally insulated, but in constant mechanical 
contact with a 'volume bath' at pressure p. For example, we could create our system inside a thermally insulated 
chamber with one movable wall where the external pressure is fixed at p. Thus, when creating the system, in 
addition to the system's internal energy E, we must also perform work pV in order to make room for the it. In 
other words, we must perform work 

W = E + pV. (2.197) 

The quantity E + pV is known as the enthalpy, H. (We use the sans-serif symbol H for enthalpy to avoid confusing 
it with magnetic field, H.) 

The enthalpy is obtained from the energy via a different Legendre transformation: 

H = E + pV . (2.198) 



In equilibrium, then, 

which says H = H(S>, N), with 



dH =TdS + Vdp + pdN , (2.199) 
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In general, we have 

<iH <T dS + Vdp + [xdN , (2.201) 
hence spontaneous processes drive the enthalpy Hto a minimum in systems at fixed (S, p, N). 
For general systems, 

dH < T dS - dW + p dV + Vdp , (2.202) 

hence 

dW bee <TdS + Vdp-dH , (2.203) 
and we conclude, for systems at fixed (S,p), that dWf rco < — dH. 



2.8.4 Gibbs free energy G 

If we create a thermodynamic system at conditions of constant temperature T and constant pressure p, then it 
absorbs heat energy Q = TS from the reservoir and we must expend work energy pV in order to make room for 
it. Thus, the total amount of work we must do in assembling our system is 

W = E - TS + pV . (2.204) 

This is the Gibbs free energy, G. 

The Gibbs free energy is obtained by a second Legendre transformation: 

G = E — TS + pV (2.205) 
Note that G = F + pV = H — TS. For equilibrium systems, the differential of G is 

dG = -SdT + Vdp + /.t dN , (2.206) 

therefore G = G(T, p, N), with 

-(it • -(fL ■ -dt- 



From eqn. 2.146, we have 



therefore 



The Second Law says that 



E = TS - pV + i-iN , (2.208) 
G = fiN . (2.209) 

dG <-SdT + Vdp + ndN , (2.210) 



hence spontaneous processes drive the Gibbs free energy G to a minimum in systems at fixed (T,p,N). For general 
systems, 

dW flcc <~SdT + Vdp - dG . (2.211) 
Accordingly, we conclude, for systems at fixed (T,p), that dWf rcc < -dG. 
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2.8.5 Grand potential Q 

The grand potential, sometimes called the Landau free energy, is defined by 

Q = E — TS - fiN . (2.212) 



Its differential is 
hence 



dQ=-SdT-pdV-Ndfi, (2.213) 



Again invoking eqn. 2.146, we find 

The Second Law tells us 
hence 



Q = - P V . (2.215) 
dQ < -dW - SdT - fidN - Ndfi , (2.216) 



dW boc = dW irco + /i dN < —SdT-pdV — NdpL — dQ . (2.217) 
We conclude, for systems at fixed (T, V, n), that dW{ iee < -dQ. 

2.9 Maxwell Relations 

Maxwell relations are conditions equating certain derivatives of state variables which follow from the exactness 
of the differentials of the various state functions. 

2.9.1 Relations deriving from E(S, V, N) 

The energy E(S, V, N) is a state function, with 

dE = TdS -pdV + fidN , (2.218) 

and therefore 

*-(SL - -(IL • -(SSL- 



Taking the mixed second derivatives, we find 



d 2 E _ /dT\ _(dp\ 

dsov - \dv) SN ~ ~ \ds) ViN 

d 2 E _ /8T\ _ /3/A 
dSdN ~ \dNj sv ~{ds) VN 

d 2 E (dp\ _ ( d/A 

WdN ~ {why ~ [dvL N 



(2.220) 
(2.221) 
(2.222) 
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2.9.2 Relations deriving from F(T, V, N) 

The energy F(T, V, N) is a state function, with 

dF = -SdT -pdV + ^dN , (2.223) 

and therefore 



Taking the mixed second derivatives, we find 



d 2 F _(^L\ -(f*2.\ 

d 2 F / 8S_ \ _ fd^\ 

WdN ~ ~ \W) TV _ \&f) VtN 

d 2 F _ f dp_\ /dfj_\ 

WdN ~ ~ [Wj TV ~ [WJ TN • 



d 2 H _ fdT\ fdV_\ 
dSd'v ~ \dp) StN ~ \dSj pN 

d 2 H /8T\ /fyA 
dSdN~{dNj Sp ~{ds) p 



N 



d 2 H _ (dV\ fdjx 



(2.225) 
(2.226) 
(2.227) 



2.9.3 Relations deriving from H (S, p, N) 

The enthalpy H(S,p, N) satisfies 

d\-\=TdS + Vdp + [xdN , (2.228) 

which says H = H(£,p, N), with 

Taking the mixed second derivatives, we find 



(2.230) 
(2.231) 
(2.232) 



2.9.4 Relations deriving from G(T, p, N) 
The Gibbs free energy G(T 7 p, N) satisfies 

dG = -SdT + Vdp + n dN , (2.233) 

therefore G = G(T,p 7 N), with 

-(it ■ -(it ■ Hm\ r 
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Taking the mixed second derivatives, we find 



d 2 G _ (dS\ _ fdV_" 

dTd~p ~ ~ \dp~) T ,N ~ Ws p ,n 

d 2 G / dS_\ /d/A 

WdN ~ \dNj TiP ~ {df) p 



N 



d 2 G _ (dV\ /d/A 
dp~dN ~ \dN) Tp ~ [dp) TN 



Taking the mixed second derivatives, we find 

d 2 Q 



dS\ = _fdp] 
dTdV \dV) T ^ Wk, 

d 2 n _ (dS\ _ SdN\ 



d 2 Q _ fdp\ _ fdN\ 



>T tfi 



dEdV V dV ) EN V dE /VN 
et cetera. 



(2.235) 
(2.236) 
(2.237) 



2.9.5 Relations deriving from i?(T, V, fj,) 
The grand potential f2(T, V, fi) satisfied 

dQ = -SdT -pdV -Ndfi, (2.238) 

hence 

-H%1 ^ -(il ■ -(ft- 



(2.240) 
(2.241) 
(2.242) 



Relations deriving from S(E, V, N) 

We can also derive Maxwell relations based on the entropy S(E, V, N) itself. For example, we have 

dS = - dE + £ dV - ^dN . (2.243) 

Therefore S = S{E, V, N) and 

d 2 S _fd(T~i)\ _fd(pT~i)\ 
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2.9.6 Generalized thermodynamic potentials 



We have up until now assumed a generalized force-displacement pair (y, X) = (— p, V). But the above results also 
generalize to e.g. magnetic systems, where (y, X) = (H, M). In general, we have 



THIS SPACE AVAILABLE 

F = E - TS 



dE = TdS + ydX + fidN 
dF = —SdT + y dX + fi dN 



(2.245) 
(2.246) 



H = E - yX 



dH = TdS - Xdy + ndN 



(2.247) 



G = E-TS-yX 



Q = E-TS-ixN 



dG 



-SdT-Xdy + fidN 



dQ = -SdT + y dX - Ndfi 



Generalizing (— p, V) — ► (y, X), we also obtain, mutatis mutandis, the following Maxwell relations: 



dXJs,N = 

(-) = 

dS \ 
dXJ TN 



d V Jt. n 
'dS\ 



dy\ 



dX 
~dS 

dy 
df 



y,N 



X,N 



dX \ 
dy\ 



dT 



s,x 



dT \ 

dS \ 
8N) TX 

dS \ 

(-) 

\dnUx 



dfx\ 

dfx\ 
dS Jy,N 

dfj, 
df 



dfx 
df 



X,N 



y,N 



dy \ 

dNj s ^ x 

dX \ 

m h 

dy \ 
8N) TX 

dX \ 

dy\ 
dfih x 



dX 



S,N 



djx\ 

d y)s,N 



\ dx /T 



djx\ 
d y Jt.n 
dN \ 



(2.248) 
(2.249) 



2.10 Equilibrium and Stability 



Suppose we have two systems, A and B, which are free to exchange energy, volume, and particle number, subject 
to overall conservation rules 



£. + E„ 



E 



V, + V n 



V 



X. + N 



N 



(2.250) 



where E, V, and N are fixed. Now let us compute the change in the total entropy of the combined systems when 
they are allowed to exchange energy, volume, or particle number. We assume that the entropy is additive, i.e. 



dS 



dE, 



dS» 



dE a 



V„,N„ 



dN, 



dE A 

dS» 
dN„ 



dS* 
dV A 



A/E A ,N A 



dN. 



as, 

9K 



(2.251) 



Note that we have used dE B = —dE A , dV B = —dV A , and dN B = —dN A . Now we know from the Second Law that 
spontaneous processes result in T dS > 0, which means that S tends to a maximum. If S is a maximum, it must 
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E + AE 


E- 


AE 


V + AV 


V - 


AV 


N + AN 


N - 


AN 



Figure 2.20: To check for an instability, we compare the energy of a system to its total energy when we reapportion 
its energy volume, and particle number slightly unequally. 



be that the coefficients of dE A , dV A , and dN A all vanish, else we could increase the total entropy of the system by 
a judicious choice of these three differentials. From TdS = dE + p dV — fi, dN, we have 

1 fdS\ p fdS\ n fdS 



T [dE) VtN ' T [dv) EiN ' T [dN) EV - (Z252) 

Thus, we conclude that in order for the system to be in equilibrium, so that S is maximized and can increase no 
further under spontaneous processes, we must have 

T A = T B (thermal equilibrium) (2.253) 

p p 

-7- = -7- (mechanical equilibrium) (2.254) 

-'a -'b 

=p = (chemical equilibrium) (2.255) 

A R 



Now consider a uniform system with energy E' = 2E, volume V = 2V, and particle number N' = 2N. We wish 
to check that this system is not unstable with respect to spontaneously becoming inhomogeneous. To that end, 
we imagine dividing the system in half. Each half would have energy E, volume V, and particle number N. But 
suppose we divided up these quantities differently, so that the left half had slightly different energy, volume, and 
particle number than the right, as depicted in Fig. 2.20. Does the entropy increase or decrease? We have 



AS = S(E + AE, V + AV, N + AN) + S(E - AE, V — AV, N - AN) - S(2E, 2V, 2N) 

d 2 S 



(A£) 2 + J£ (A^) 5 



dE 2 



Thus, we can write 



dV 2 



dN 2 



(ANY 



(2.256) 



Q2o a2a a2c 

2 AE AV + 2 - — — AE AN + 2 - — — AN 



dEdV dEdN 
AS 



dVdN 



(2.257) 



where 



Q 



d 2 S 


d 2 S 


d 2 S 


dW 


OEOV 


dE ON 


d 2 S 


d 2 S 


d 2 S 


dEdV 




dV dN 


d 2 S 


d 2 S 


d 2 S 


dE dN 


dV ON 


ON? 



is the matrix of second derivatives, known in mathematical parlance as the Hessian, and \P 
that Q is a symmetric matrix. 



(2.258) 



(AE, AV, AN). Note 



60 



CHAPTER 2. THERMODYNAMICS 



Since S must be a maximum in order for the system to be in equilibrium, we are tempted to conclude that the 
homogeneous system is stable if and only if all three eigenvalues of Q are negative. If one or more of the eigen- 
values is positive, then it is possible to choose a set of variations \P such that AS > 0, which would contradict 
the assumption that the homogeneous state is one of maximum entropy. A matrix with this restriction is said 
to be negative definite. While it is true that Q can have no positive eigenvalues, it is clear from homogeneity of 
S(E, V, N) that one of the three eigenvalues must be zero, corresponding to the eigenvector \P = (E, V, N). Ho- 
mogeneity means S(XE, XV, XN) = XS(E,V,N). Now let us take A = 1 + r), where r\ is infinitesimal. Then 
AE = r]E, AV = rjV, and AiV = r/N, and homogeneity says S(E ± AE, V±AV,N± AN) = (1 ± rj) S(E, V, N) 
and AS = (l+rj)S+(l—r])S—2S = 0. We then have a slightly weaker characterization of Q as negative semidefinite. 

However, if we fix one of the components of ( AE, AV, AN) to be zero, then \P must have some component orthog- 
onal to the zero eigenvector, in which case AS* > 0. Suppose we set AiV = and we just examine the stability with 
respect to inhomogeneities in energy and volume. We then restrict our attention to the upper left 2x2 submatrix 
of Q. A general symmetric 2x2 matrix may be written 



Q = 

It is easy to solve for the eigenvalues of Q. One finds 




(2.259) 



(2.260) 



In order for Q to be negative definite, we require A + < and A_ < 0. Clearly we must have a + c < 0, or else 

< then clearly A_ < 0, but there still is a possibility that A + > 0, if the radical is larger 



A + > for sure. If a 



than — i(a + c). Demanding that A + < therefore yields two conditions: 



c < 



and 



,c > b 2 



(2.261) 



Clearly both a and c must be negative, else one of the above two conditions is violated. So in the end we have 
three conditions which are necessary and sufficient in order that Q be negative definite: 



a < , c < 
Going back to thermodynamic variables, this requires 



ac > b 



d 2 S 
dE 2 



< 



d 2 S 
dV 2 



< 



8 2 S d 2 S 
dE 2 ' dV 2 



> 



d 2 S 
dEdV 



(2.262) 



(2.263) 



Another way to say it: the entropy is a concave function of(E, V) at fixed N. Had we set AE = and considered the 
lower right 2x2 submatrix of Q, we'd have concluded that S(V, N) is concave at fixed E. 

Many thermodynamic systems are held at fixed (T, p, N), which suggests we examine the stability criteria for 
G(T,p,N). Suppose our system is in equilibrium with a reservoir at temperature T and pressure p . Then, 
suppressing TV (which is assumed constant), we have 



G(T ,p ) 



T S- 



P V. 



(2.264) 



Now suppose there is a fluctuation in the entropy and the volume of our system. Going to second order in AS 
and AV, we have 



AG = 



8E\ 
dSJv 



AS- 



d 2 E 
dS 2 



(ASf 



dE\ 

dVJs^ 

, d 2 E 
J dSdV 



■Pa 



AV 



ASAV 



d 2 E 
W 2 



(2.265) 



(AV)' 



+ 
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The condition for equilibrium is that AG > for all (AS, AV). The linear terms vanish by the definition since 
T = T and p = p . Stability then requires that the Hessian matrix Q be positive definite, with 



Thus, we have the following three conditions: 



d 2 E d 2 E 



OS 2 dSdV 

Q = I I • (2.266) 



d 2 E d 2 E 



os* - vSI = k > (Z267) 



d 2 E 
d 2 E 



dV 2 \dV J s Vk s 



> (2.268) 



d 2 E d 2 E _ ( d 2 E V _ T _ fdTX 
dS^'dV 2 ' [dSdV J ~ Vk s C v ~ \dV ) s > ( ' 



2.11 Applications of Thermodynamics 

A discussion of various useful mathematical relations among partial derivatives may be found in the appendix in 
§2.17. Some facility with the differential multivariable calculus is extremely useful in the analysis of thermody- 
namics problems. 



2.11.1 Adiabatic free expansion revisited 



Consider once again the adiabatic free expansion of a gas from initial volume V i to final volume V s = rV v Since 
the system is not in equilibrium during the free expansion process, the initial and final states do not lie along an 
adiabat, i.e. they do not have the same entropy. Rather, as we found, from Q = W = 0, we have that E i = E ( , 
which means they have the same energy, and, in the case of an ideal gas, the same temperature (assuming N is 
constant). Thus, the initial and final states lie along an isotherm. The situation is depicted in Fig. 2.21. Now let us 
compute the change in entropy AS = S f — S i by integrating along this isotherm. Note that the actual dynamics 
are irreversible and do not quasistatically follow any continuous thermodynamic path. However, we can use what 
is a fictitious thermodynamic path as a means of comparing S in the initial and final states. 



We have 

v { 



dS\ 



AS = S f -S i = dV(— . (2.270) 



v, 



dV ), 



T,N 



But from a Maxwell equation deriving from F, we have 



§L - (IL ■ 



hence 



dp\ 



AS= dV h£ ■ (2-272) 



V,N 
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const. 



temperature T 



Figure 2.21: Adiabatic free expansion via a thermal path. The initial and final states do not lie along an adabat! 
Rather, for an ideal gas, the initial and final states lie along an isotherm. 



For an ideal gas, we can use the equation of state pV = Nk B T to obtain 

g] =™». (2.273) 
The integral can now be computed: 

rV. 

f Nk 

AS = dV — p = Nk B In r , (2.274) 

v i 

as we found before, in eqn. 2.156 What is different about this derivation? Previously, we derived the entropy 
change from the explicit formula for S(E, V, N). Here, we did not need to know this function. The Maxwell 
relation allowed us to compute the entropy change using only the equation of state. 



2.11.2 Energy and volume 

We saw how E{T, V, N) — ^fNk B T for an ideal gas, independent of the volume. In general we should have 



E(T,V,N) = N</>(T,%) . 



(2.275) 



For the ideal gas, (j>(T, 



N , 



^fk B T is a function of T alone and is independent on the other intensive quantity 



V/N. How does energy vary with volume? At fixed temperature and particle number, we have, from E = F + TS, 



dE\ 
W), 



T.N 



dF\ 

w), 



T 



T,N 



dS\ 

9V\ =- p + T 



T.N 



dT], 



(2.276) 



V,N 



where we have used the Maxwell relation (||) 

V OV J T.N 



,8T i 



, derived from the mixed second derivative 



dT av 



Another way to derive this result is as follows. Write dE = T dS — p dV + n dN and then express dS in terms of 
dT, dV, and dN, resulting in 



dE = T 



dS\ 
dT) 



dT 



V.N 



T 



dS_\ 



T.N 



dV 



T 



dfx\ 
dT), 



V.N 



dN . 



(2.277) 
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Now read off (ff ) and use the same Maxwell relation as before to recover eqn. 2.276. Applying this result to 
the ideal gas law pV = Nk B T results in the vanishing of the RHS, hence for any substance obeying the ideal gas 
law we must have 

E(T, V,N)=v e(T) = Ne(T) / N A . (2.278) 



2.11.3 van der Waals equation of state 

It is clear that the same conclusion follows for any equation of state of the form p(T, V,N) = T ■ f(V/N), where 
f(V/N) is an arbitrary function of its argument: the ideal gas law remains valid 10 . This is not true, however, for 
the van der Waals equation of state, 

P+ ^)( v - b ) = RT > ( 2 - 279 ) 
where v = N A V/N is the molar volume. We then find (always assuming constant N), 

where E(T, V, N) = vs(T, v). We can integrate this to obtain 

e{T,v)=u>(T)--, (2.281) 
v 

where w(T) is arbitrary. From eqn. 2.33, we immediately have 

Cv = (If) =lu ' {t) - {2282) 

What about c p ? This requires a bit of work. We start with eqn. 2.34, 

de \ ( dv \ 



i' 



We next take the differential of the equation of state (at constant N): 



RdT ~ [ p +^)dv+(v 



~ b ) ( d P - ^ dvj 



P-^2+^r) dv +( v - b ) d P 



We can now read off the result for the volume expansion coefficient, 



p v\dT 



(2.283) 



(2.284) 



1 f dv\ 1 R 
«p = r(^J =-■ _ a+ 2« b - 



10 Note V/N = v/N A . 
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We now have for c , 



r = J(T) + iP+ ^ R 
-p w ; T a | 2afc 

^ ^ (2.286) 



where i; = VN A /N is the molar volume. 

To fix ui(T), we consider the « — > oo limit, where the density of the gas vanishes. In this limit, the gas must be 
ideal, hence eqn. 2.281 says that uj(T) = ^fRT. Therefore c v (T,v) — \fR, just as in the case of an ideal gas. 
However, rather than c p = c v + R, which holds for ideal gases, c p (T, v) is given by eqn. 2.286. Thus, 

c vdw = ij R (2 287) 

^-y^ RT /! T 2 :l_ b? - (2-288) 
Note that c p (a — > 0) = c v + R, which is the ideal gas result. 

2.11.4 Thermodynamic response functions 

Consider the entropy S expressed as a function of T, V, and N: 

dS=( d 4) dT + ( d 4) dV + (^) dN. (2.289) 



9TJ VN \dVj TN \dNj. 



>T,V 



Dividing by dT, multiplying by T, and assuming dN = throughout, we have 



This allows us to write 



We define the response functions, 



(2.291) 



C p -C v = -T -i- — . (2.292) 



dVj T \dTj v 

Appealing to a Maxwell relation derived from F(T, V, N), and then appealing to eqn. 2.492, we have 

- (Oil] - f—\ 

dv) T ~{df)y- \dV) T \&f) p 



dp\ fdV\ 2 



1 / dV\ 1 d 2 G 

isothermal compressibility: n T = — — J = — — (2.293) 

V \op J T V op* 

i- i ■ -i .,. 1 fdV\ 1 d 2 H ,„„„,. 

adiabatic compressibility: k, = — — = — — ■ (2.294) 

V \ op J s V ap z 

1 [dV\ 



thermal expansivity: a p = — J ■ (2.295) 
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Thus, 

w Ta 



C< - C v = V ^ , (2.296) 



or, in terms of intensive quantities, 

v Tat 

c p -c v = — ^ , (2.297) 



where, as always, v = VN A /N is the molar volume. 

This above relation generalizes to any conjugate force-displacement pair (—p, V) — ► (y, X): 



C y — C x — T 



T 



dy\ fdx\ 

8T x \dT 

v (2.298) 

dy\ fdX\ 2 



dX ) T \ dT J, 



For example, we could have (y, X) = (H a ,M a ). 



A similar relationship can be derived between the compressibilities k t and k s . We then clearly must start with 
the volume, writing 

Dividing by dp, multiplying by — V 1 , and keeping N constant, we have 



S ~ V\8s) p \dp) T - 
Again we appeal to a Maxwell relation, writing 

dS\ _ /dV\ 
dp-) T " " {dT I ' 

and after invoking the chain rule, 

'dV\ fdV\ fdT\ T fdV\ 



OS J p \dTj p \dSj p C p \dTj p 



(2.300) 



(2.301) 



(2.302) 



we obtain 

vTal 

k t — k s — . (2.303) 

Cp 

Comparing eqns. 2.297 and 2.303, we find 

(Cp - c v ) k t = (k t - k s ) c p = v Ta 2 p . (2.304) 

This result entails 



(2.305) 



Cy Kg 

The corresponding result for magnetic systems is 



(dm\ 
dr) H ' (2 ' 306) 
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1 




~dH) T ~ 


V 


dH 2 


dM\ 


1 


d 2 H 


dWJ s - 


V 


dH 2 



where m = Mj v is the magnetization per mole of substance, and 

(2.307) 
(2.308) 

y un j g v un - 

Here the enthalpy and Gibbs free energy are 

H = E - HM dH = TdS — M dH (2.309) 

G = E-TS- HM dG = —SdT — M dH . (2.310) 

Remark: The previous discussion has assumed an isotropic magnetic system where M and H are collinear, hence 
H M = HM. 

a p _ ( dM a \ 1 dXi 



X ^ -{dHp) s ~ v dH- 3HP • (2 ' 312) 

In this case, the enthalpy and Gibbs free energy are 

H = E — HM dH = T dS — M ■ dH (2.313) 

G = E -TS - H M dG = -SdT - M dH . (2.314) 



fdM a \ 



1 d 2 H 



2.11.5 Joule effect: free expansion of a gas 



Previously we considered the adiabatic free expansion of an ideal gas. We found that Q = W = hence AE = 0, 
which means the process is isothermal, since E = vs(T) is volume-independent. The entropy changes, however, 
since S(E, V, N) = Nk B Ln(V/iV) + ±/iVfc B ]n(E/N) + Ns . Thus, 



(2.315) 



What happens if the gas is nonideal? 



We integrate along a fictitious thermodynamic path connecting initial and final states, where dE = along the 
path. We have 



= dE 



8E\ 



dV 



dE\ 



dT 



hence 



We also have 



dT\ 
dVJ, 



(dE/dV) T 
(dE/dT) v 



J_ fdE\ 
C~ [dVjj 



d 4) =t(^) -p = t(£) -p 



dVL 



dV J, 



9TL 



Thus, 



0T\ 



p-T 



(2.316) 
(2.317) 
(2.318) 

(2.319) 
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gas 


a 

V mol- / 


b (A) 

V mol / 


p c (bar) 


T c (K) 


v c (L/mol) 


Acetone 


14.09 


0.0994 


52.82 


505.1 


0.2982 


Argon 


1.363 


0.03219 


48.72 


150.9 


0.0966 


Carbon dioxide 


3.640 


0.04267 


7404 


304.0 


0.1280 


hthanol 


12.18 


0.08407 


63.83 


516.3 


0.2522 


Freon 


10.78 


0.0998 


40.09 


384.9 


0.2994 


Helium 


0.03457 


0.0237 


2.279 


5.198 


0.0711 


Hydrogen 


0.2476 


0.02661 


12.95 


33.16 


0.0798 


Mercury 


8.200 


0.01696 


1055 


1723 


0.0509 


Methane 


2.283 


0.04278 


46.20 


190.2 


0.1283 


Nitrogen 


1.408 


0.03913 


34.06 


128.2 


0.1174 


Oxygen 


1.378 


0.03183 


50.37 


154.3 


0.0955 


Water 


5.536 


0.03049 


220.6 


647.0 


0.0915 



Table 2.3: Van der Waals parameters for some common gases. (Source: Wikipedia.) 



Note that the term in square brackets vanishes for any system obeying the ideal gas law. For a nonideal gas, 



AT = dV[ — , (2.320) 



v. 



dVJ 



E 



which is in general nonzero. 

Now consider a van der Waals gas, for which 



p + 4 ) ( v - h ) = RT 



r r (dp\ a av 2 

P- T \l^) =-73 = -T?I- ( 2 - 321 ) 



We then have 

P ~ T (dTj v ~ v 2 ~ V 2 
In §2.11.3 we concluded that C v = \ fvR for the van der Waals gas, hence 

v { 

2au fdV 2a ( 1 1 \ 
Thus, if V f > Vj, we have T { < TJ and the gas cools upon expansion. 

Consider 2 gas with an initial specific volume of v i = 22.4 L/mol, which is the STP value for an ideal gas, freely 
expanding to a volume v { = oo for maximum cooling. According to table 2.3, a = 1.378 L 2 • bar/mol 2 , and we 
have AT = —2a/fRv i = — 0.296 K, which is a pitifully small amount of cooling. Adiabatic free expansion is a 
very inefficient way to cool a gas. 
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Figure 2.22: In a throttle, a gas is pushed through a porous plug separating regions of different pressure. The 
change in energy is the work done, hence enthalpy is conserved during the throttling process. 



2.11.6 Throttling: the Joule-Thompson effect 

In a throttle, depicted in Fig. 2.22, a gas is forced through a porous plug which separates regions of different 
pressures. According to the figure, the work done on a given element of gas is 



W= dV P{ - dVp^pM-p-M 



(2.323) 



Now we assume that the system is thermally isolated so that the gas exchanges no heat with its environment, nor 
with the plug. Then Q = so AE = —W, and 



Ei+PiV^Ef + Pf V i 
Hi = H f , 



(2.324) 
(2.325) 



where H is enthalpy. Thus, the throttling process is isenthalpic. We can therefore study it by defining a fictitious 
thermodynamic path along which dH = 0. The, choosing T and p as state variables, 



= dH 



dH 

dT 



dp 



<9H\ 

dp J j 



dT 



hence 



'dT\ (dH/dp) T 

dp) H ~ ~m/dT) p ■ 

The numerator on the RHS is computed by writing dH = T dS + V dp and then dividing by dp, to obtain 



8H\ 

dpi 



dp 



1 dT 



(2.326) 



(2.327) 



(2.328) 



The denominator is 



9H 

dT 



dH 



dS 



T 



dSj p \dT 
dS 



dT 



(2.329) 



Thus, 



dT\ _L_ 

dp ) H c p 

v 



, dv 
T — 



dT 
(Ta p - 1 



(2.330) 
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where a p = y is the volume expansion coefficient. 
From the van der Waals equation of state, we obtain, from eqn. 2.285, 

T / dv\ RT/v _ v-b 



To_ = - — = '—^-r = ~ . (2.331) 



Assuming v » , b, we have 



(2.332) 

op J H c p\ RT J 

Thus, for T > T* = j^, we have [^-) < and the gas heats up upon an isenthalpic pressure decrease. For 



Rb' ,v> - iL " v ^- ^ Op J H 

T <T* , the gas cools under such conditions. 

In fact, there are two inversion temperatures T* 2 for the van der Waals gas. To see this, we set Ta p = 1, which is 
the criterion for inversion. From eqn. 2.331 it is easy to derive 



We insert this into the van der Waals equation of state to derive a relationship T = T* (p) at which Ta p = 1 holds. 
After a little work, we find 

3RT ISaRT a 
P=-^ b - + \j—-p- < 2 - 334 ) 

This is a quadratic equation for T, the solution of which is 
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In Fig. 2.23 we plot pressure versus temperature in scaled units, showing the curve along which (jlffj = 0- The 
volume, pressure, and temperature scales defined are 

v c = 3b , Pc = ^ , T c = ^. (2-336) 

Values for p c , T c , and v c are provided in table 2.3. If we define v = v /v c , p = p/p c , and T = T/T c , then the van der 
Waals equation of state may be written in dimensionless form: 



(2.337) 



In terms of the scaled parameters, the equation for the inversion curve (7^7) =0 becomes 



2 



p = 9-36(l- JiT ) <f=> T = 3(l±Jl-ip . (2.338) 



Thus, there is no inversion for p > 9p c . We are usually interested in the upper inversion temperature, T 2 *, 
corresponding to the upper sign in eqn. 2.335. The maximum inversion temperature occurs for p = 0, where 

T max = ff = T T c- For H 2' from the data m table 2 -3, we find T £ax( H 2) = 224 K, which is within 10% of the 
experimentally measured value of 205 K. 

What happens when H 2 gas leaks from a container with T > T 2 ? Since f^J < anc i ^p < 0, we have AT > 0. 

The gas warms up, and the heat facilitates the reaction 2 H 2 + 2 — ► 2 H 2 0, which releases energy, and we have 
a nice explosion. 



2.12 Phase Transitions and Phase Equilibria 

A typical phase diagram of a p-V-T system is shown in the Fig. 2.24(a). The solid lines delineate boundaries 
between distinct thermodynamic phases. These lines are called coexistence curves. Along these curves, we can 
have coexistence of two phases, and the thermodynamic potentials are singular. The order of the singularity is 
often taken as a classification of the phase transition. I.e. if the thermodynamic potentials E, F, G, and H have 
discontinuous or divergent m th derivatives, the transition between the respective phases is said to be m th order. 
Modern theories of phase transitions generally only recognize two possibilities: first order transitions, where the 
order parameter changes discontinuously through the transition, and second order transitions, where the order param- 
eter vanishes continuously at the boundary from ordered to disordered phases 11 . We'll discuss order parameters 
during Physics 140B. 

For a more interesting phase diagram, see Fig. 2.24(b,c), which displays the phase diagrams for 3 He and 4 He. The 
only difference between these two atoms is that the former has one fewer neutron: (2p + In + 2e) in 3 He versus (2p 
+ 2n + 2e) in 4 He. As we shall learn when we study quantum statistics, this extra neutron makes all the difference, 
because 3 He is a fermion while 4 He is a boson. 



2.12.1 p-v-T surfaces 

The equation of state for a single component system may be written as 

f(p,v,T) = 0. (2.339) 
11 Some exotic phase transitions in quantum matter, which do not quite fit the usual classification schemes, have recently been proposed. 
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Figure 2.24: (a) Typical thermodynamic phase diagram of a single component p-V-T system, showing triple point 
(three phase coexistence) and critical point. (Source: Univ. of Helsinki.) Also shown: phase diagrams for 3 He (b) 
and 4 He (c). What a difference a neutron makes! (Source: Brittanica.) 



This may in principle be inverted to yield p = p(v, T) or v = v(T,p) or T = T(p, v). The single constraint f(p, v, T) 
on the three state variables defines a surface in {p, v, T} space. An example of such a surface is shown in Fig. 2.25, 
for the ideal gas. 

Real p-v-T surfaces are much richer than that for the ideal gas, because real systems undergo phase transitions in 
which thermodynamic properties are singular or discontinuous along certain curves on the p-v-T surface. An 
example is shown in Fig. 2.26. The high temperature isotherms resemble those of the ideal gas, but as one cools 
below the critical temperature T c , the isotherms become singular. Precisely at T = T c , the isotherm p = p(v, T c ) 
becomes perfectly horizontal at v = v c , which is the critical molar volume. This means that the isothermal com- 
pressibility, k t = — ^(§^) T diverges at T = T c . Below T c , the isotherms have a flat portion, as shown in Fig. 2.28, 
corresponding to a two-phase region where liquid and vapor coexist. In the (p, T) plane, sketched for H 2 in Fig. 2.4 
and shown for C0 2 in Fig. 2.29, this liquid-vapor phase coexistence occurs along a curve, called the vaporization 
(or boiling) curve. The density changes discontinuously across this curve; for H 2 0, the liquid is approximately 
1000 times denser than the vapor at atmospheric pressure. The density discontinuity vanishes at the critical point. 
Note that one can continuously transform between liquid and vapor phases, without encountering any phase 
transitions, by going around the critical point and avoiding the two-phase region. 

In addition to liquid-vapor coexistence, solid-liquid and solid-vapor coexistence also occur, as shown in Fig. 2.26. 
The triple point (T t ,p t ) lies at the confluence of these three coexistence regions. For H 2 0, the location of the triple 
point and critical point are given by 

T t = 273.16K T C = 647K 

p t = 611.7Pa= 6.037 x 10" 3 atm Pc = 22.06 MPa = 217.7 atm 



2.12.2 The Clausius-Clapeyron relation 

Recall that the homogeneity of E(S,V,N) guaranteed E = TS— pV + ^iN, from Euler's theorem. It also guarantees 
a relation between the intensive variables T, p, and /i, according to eqn. 2.148. Let us define g = G/v = N A fi, the 
Gibbs free energy per mole. Then 

dg = -sdT + vdp , (2.340) 
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Figure 2.25: The surface p(v, T) = RT/v corresponding to the ideal gas equation of state, and its projections onto 
the (p, T), (p, v), and (T, v) planes. 



where s = S/v and v = Vjv are the molar entropy and molar volume, respectively. Along a coexistence curve 
between phase #1 and phase #2, we must have g 1 = g 2 , since the phases are free to exchange energy and particle 
number, i.e. they are in thermal and chemical equilibrium. This means 

dg 1 = —s 1 dT + v 1 dp = — s 2 dT + v 2 dp = dg 2 ■ (2.341) 

Therefore, along the coexistence curve we must have 

dp \ s ? ^ S 1 & ,~ ~ , m 

(2.342) 



v dT ) „„„„ v — Vi T Av ' 
where 

£ = T As = T{s 2 - Sl ) (2.343) 

is the molar latent heat of transition. A heat I must be supplied in order to change from phase #1 to phase #2, even 
without changing p or T. If I is the latent heat per mole, then we write I as the latent heat per gram: I = l/M , 
where M is the molar mass. 

Along the liquid-gas coexistence curve, we typically have w gas » % qu id' an d assuming the vapor is ideal, we may 
write Av sa v gas « RT/p. Thus, 

dp\ £ p£ 

(2.344) 



dTJ^-vs TAv i?T2 



If £ remains constant throughout a section of the liquid-gas coexistence curve, we may integrate the above equation 
to get 

~ = => p(T)=p(T )e^e-^ T . (2.345) 

p R l z 
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Figure 2.26: A p-v-T surface for a substance which contracts upon freezing. The red dot is the critical point and the 
red dashed line is the critical isotherm. The yellow dot is the triple point at which there is three phase coexistence of 
solid, liquid, and vapor. 



2.12.3 Liquid-solid line in H 2 

Life on planet earth owes much of its existence to a peculiar property of water: the solid is less dense than the 
liquid along the coexistence curve. For example at T = 273.1 K and p = 1 atm, 

£ watcr = 1.00013 cm 3 /g , v icc = 1.0907 cm 3 /g ■ (2.346) 

The latent heat of the transition is I = 333 J/g = 79.5 cal/g. Thus, 

dp\ £ 333 J/g 

d?J nq-sd ~~ TA£ ~~ (273.1 K) (-9.05 x H)- 2 c m 3/g) {2M?) 

a dyn atm 
= -1.35 x 10 8 ^— = -134— . 

cm 2 K °C 

The negative slope of the melting curve is invoked to explain the movement of glaciers: as glaciers slide down 
a rocky slope, they generate enormous pressure at obstacles 12 Due to this pressure, the story goes, the melting 
temperature decreases, and the glacier melts around the obstacle, so it can flow past it, after which it refreezes. 
But it is not the case that the bottom of the glacier melts under the pressure, for consider a glacier of height 
h = 1 km. The pressure at the bottom is p ~ gh/v ~ 10 7 Pa, which is only about 100 atmospheres. Such a pressure 
can produce only a small shift in the melting temperature of about AT melt = —0.75° C. 

Does the Clausius-Clapeyron relation explain how we can skate on ice? My seven year old daughter has a mass 
of about M = 20 kg. Her ice skates have blades of width about 5 mm and length about 10 cm. Thus, even on one 



12 The melting curve has a negative slope at relatively low pressures, where the solid has the so-called Hi hexagonal crystal structure. At 
pressures above about 2500 atmospheres, the crystal structure changes, and the slope of the melting curve becomes positive. 
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Figure 2.27: Equation of state for a substance which expands upon freezing, projected to the (v, T) and (v,p) and 
(T, p) planes. 



foot, she only imparts an additional pressure of 



Mg ^ 20 kg x 9.8 m/s 2 
~A~ ~ (5 x 10- 3 m) x (lO-im) 



The change in the melting temperature is thus minuscule: AT n 



3.9 x 10 5 Pa = 3.9 atm . 



-0.03° C. 



(2.348) 



So why can my daughter skate so nicely? The answer isn't so clear! 13 There seem to be two relevant issues in play. 
First, friction generates heat which can locally melt the surface of the ice. Second, the surface of ice, and of many 
solids, is naturally slippery. Indeed, this is the case for ice even if one is standing still, generating no frictional 
forces. Why is this so? It turns out that the Gibbs free energy of the ice-air interface is larger than the sum of free 
energies of ice-water and water-air interfaces. That is to say, ice, as well as many simple solids, prefers to have a 
thin layer of liquid on its surface, even at temperatures well below its bulk melting point. If the intermolecular 
interactions are not short-ranged 14 , theory predicts a surface melt thickness d oc (T m — T) -1 / 3 . In Fig. 2.30 we 
show measurements by Gilpin (1980) of the surface melt on ice, down to about —50° C. Near 0° C the melt layer 
thickness is about 40 nm, but this decreases to ~ lnm at T = — 35° C. At very low temperatures, skates stick 
rather than glide. Of course, the skate material is also important, since that will affect the energetics of the second 
interface. The 19th century novel, Hans Blinker, or The Silver Skates by Mary Mapes Dodge tells the story of the 
poor but stereotypically decent and hardworking Dutch boy Hans Brinker, who dreams of winning an upcoming 
ice skating race, along with the top prize: a pair of silver skates. All he has are some lousy wooden skates, which 
won't do him any good in the race. He has money saved to buy steel skates, but of course his father desperately 
needs an operation because - I am not making this up - he fell off a dike and lost his mind. The family has no 
other way to pay for the doctor. What a story! At this point, I imagine the suspense must be too much for you to 
bear, but this isn't an American Literature class, so you can use Google to find out what happens (or rent the 1958 



13 For a recent discussion, see R. Rosenberg, Physics Today 58, 50 (2005). 

14 For example, they could be of the van der Waals form, due to virtual dipole fluctuations, with an attractive 1/r 6 tail. 
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Figure 2.28: Projection of the p-v-T surface of Fig. 2.26 onto the (v,p) plane. 

movie, directed by Sidney Lumet). My point here is that Hans' crappy wooden skates can't compare to the metal 
ones, even though the surface melt between the ice and the air is the same. The skate blade material also makes a 
difference, both for the interface energy and, perhaps more importantly, for the generation of friction as well. 



2.12.4 Slow melting of ice : a quasistatic but irreversible process 

Suppose we have an ice cube initially at temperature T < = 273.15 K (i.e. = 0° C) and we toss it into a pond 
of water. We regard the pond as a heat bath at some temperature T x > 0. Let the mass of the ice be M. How much 
heat Q is absorbed by the ice in order to raise its temperature to T t l Clearly 

Q = Mc s (S - T ) + Ml+ Mc^fc - 0) , (2.349) 

where c s and c L are the specific heats of ice (solid) and water (liquid), respectively 15 , and £ is the latent heat 
of melting per unit mass. The pond must give up this much heat to the ice, hence the entropy of the pond, 
discounting the new water which will come from the melted ice, must decrease: 

A5 pond = -9. . (2.350) 
Now we ask what is the entropy change of the H 2 in the ice. We have 

- T l 

AS icc = / -^r = dT^ + — + dT- 



T / TGI T 

e (2.351) 

, <9\ Ml (T 

^t ) + W + Mc Ho 



15 We assume c s (T) and c L (T) have no appreciable temperature dependence, and we regard them both as constants. 
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Figure 2.29: Phase diagram for CO2 in the (p, T) plane. (Source: www.scifun.org.) 



The total entropy change of the system is then 

AStotal = AS pond + AS ice 



Mc c In! - Me" (^] + Ml Mc, in(^) - Mc, ^ " ^ (Z352) 



Jo J V T i J \® T iJ L \° J v T i 

Now since T < < T lt we have 

M£ s ( £=3l) < Mc s ( ^-^) . (2.353) 



T x ) S V 

Therefore, 

AS > Ml (A-jr) + M ~° s / ( T "/ ) + Md - f( e / T i) > ( 2 - 354 ) 

where f(x) = x — 1 — In x. Clearly /' (x) = 1 — a; - 1 is negative on the interval (0,1), which means that the maximum 
of f(x) occurs at x = and the minimum at x = 1. But /(0) = 00 and /(l) = 0, which means that f(x) > for 
x e [0, 1]. Since T < < T x , we conclude A5 total > 0. 



2.12.5 Gibbs phase rule 

Equilibrium between two phases means that p, T, and /x(p, T) are identical. From 

/i 1 (p,T)= M2 (p,T), (2.355) 

we derive an equation for the slope of the coexistence curve, the Clausius-Clapeyron relation. Note that we have 
one equation in two unknowns (T, p), so the solution set is a curve. For three phase coexistence, we have 

Ml (p, T) = /i 2 (p, T) = (i 3 (p, T) , (2.356) 

which gives us two equations in two unknowns. The solution is then a point (or a set of points). A critical point 
also is a solution of two simultaneous equations: 

critical point =^ v x (p,T) = v 2 (p,T) , ^(p.T) = fx 2 (p,T) . (2.357) 
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Figure 2.30: Left panel: data from R. R. Gilpin, /. Colloid Interface Sci. 77, 435 (1980) showing measured thickness 
of the surface melt on ice at temperatures below 0°C. The straight line has slope — -|, as predicted by theory. Right 
panel: phase diagram of H^O, showing various high pressure solid phases. (Source : Physics Today, December 
2005). 

Recall v = N A (^) T . Note that there can be no four phase coexistence for a simple p-V-T system. 

Now for the general result. Suppose we have a species, with particle numbers N a , where a = 1, . . . , a. It is 
useful to briefly recapitulate the derivation of the Gibbs-Duhem relation. The energy E(S, V, N-y, . . . , N^) is a 
homogeneous function of degree one: 

E(XS, XV, XN X ,..., XN a ) = XE(S, V, AT ls . . . ,N a ) . (2.358) 

From Euler's theorem for homogeneous functions (just differentiate with respect to A and then set A = 1), we have 

E = TS-pV + Y / V a N a - (2-359) 

a=l 

Taking the differential, and invoking the First Law, 

a 

dE = TdS -pdV + J2^a dN a . (2.360) 

we arrive at the relation 



a=l 



SdT- Vdp + J2 N a = 



(2.361) 



of which eqn. 2.147 is a generalization to additional internal 'work' variables. This says that the a + 2 quantities 
(T,p, [j,!, . . . , n a ) are not all independent. We can therefore write 



(2.362) 



If there are if different phases, then in each phase j, with j = l,...,<p, there is a chemical potential ^ for each 
species a. We then have 

^(T.p,^,...,^). (2.363) 
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Here p!i is the chemical potential of the a tb species in the j th phase. Thus, there are tp such equations relating the 
Z + tp a variables (T,p, {^a ] }) , meaning that only 2 + tp (a — 1) of them may be chosen as independent. This, then, 
is the dimension of 'thermodynamic space' containing a maximal number of intensive variables: 

d TD {a 7 ip) = 2 + Lp {a- 1) . (2.364) 

To completely specify the state of our system, we of course introduce a single extensive variable, such as the total 
volume V. Note that the total particle number N = J2a=i N a may not be conserved in the presence of chemical 
reactions! 

Now suppose we have equilibrium among ip phases. We have implicitly assumed thermal and mechanical equi- 
librium among all the phases, meaning that p and T are constant. Chemical equilibrium applies on a species-by- 
species basis. This means 

M « = (2.365) 

where j,f € {1, . . . , tp}. This gives <r(ip — 1) independent equations equations 16 . Thus, we can have phase equilib- 
rium among the tp phases of a species over a region of dimension 

d PE (cr,(p) = 2 + tp(a- 1) -<r(p- 1) 

(2.366) 

= 2 + (7 — tp . 

Since d PE > 0, we must have tp < a + 2. Thus, with two species (a = 2), we could have at most four phase 
coexistence. 

If the various species can undergo p distinct chemical reactions of the form 

C[ r} A, + C { 2 r} A 2 + • • • + C« K = , (2.367) 

where A a is the chemical formula for species a, and Q r ' is the stoichiometric coefficient for the a th species in the 
r th reaction, with r = 1 , . . . , p, then we have an additional p constraints of the form 

E^V^= 0. (2.368) 

a=l 

Therefore, 

d PE (a,ip,p)=2 + a-(p-p. (2.369) 

One might ask what value of j are we to use in eqn. 2.368, or do we in fact have ip such equations for each r? The 
answer is that eqn. 2.365 guarantees that the chemical potential of species a is the same in all the phases, hence it 
doesn't matter what value one chooses for j in eqn. 2.368. 

Let us assume that no reactions take place, i.e. p = 0, so the total number of particles Ylh=i ^ s conserved. 
Instead of choosing (T,p, fj,^, . . . , fJ^_i) as d TD intensive variables, we could have chosen (T,p, p 1 , . . . , a;*?2i)/ where 
x a = N a /N is the concentration of species a. 

Why do phase diagrams in the (p, v) and (T, v) plane look different than those in the (p, T) plane? 17 For example, 
Fig. 2.27 shows projections of the p-v-T surface of a typical single component substance onto the (T, v), (p, v), and 
{p, T) planes. Coexistence takes place along curves in the (p, T) plane, but in extended two-dimensional regions 
in the (T, v) and (p, v) planes. The reason that p and T are special is that temperature, pressure, and chemical 
potential must be equal throughout an equilibrium phase if it is truly in thermal, mechanical, and chemical equi- 
librium. This is not the case for an intensive variable such as specific volume v = N A V/N or chemical concentration 
x a = NJN. 



16 Set j = l and let j' range over the tp — 1 values 2, .. . ,ip. 

17 The same can be said for multicomponent systems: the phase diagram in the (T, x) plane at constant p looks different than the phase 
diagram in the (T, (i) plane at constant p. 
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2.13 Entropy of Mixing and the Gibbs Paradox 
2.13.1 Computing the entropy of mixing 

Entropy is widely understood as a measure of disorder. Of course, such a definition should be supplemented by 
a more precise definition of disorder - after all, one man's trash is another man's treasure. To gain some intuition 
about entropy let us explore the mixing of a multicomponent ideal gas. Let N = J2 a N a be the total number of 
particles of all species, and let x a = N a /N be the concentration of species a. Note that J2 a x a = 1- 

For any substance obeying the ideal gas law pV = Nk B T, the entropy is 

S(T, V, N) = Nk B \n(V/N) + N<f>(T) , (2.370) 

since (ff) = (§r) v N = ht 1 - Note that in eqn. 2.370 we have divided V by N before taking the logarithm. 
This is essential if the entropy is to be an extensive function (see §2.7.5). One might think that the configurational 
entropy of an ideal gas should scale as ln(V ) = N In V, since each particle can be anywhere in the volume 
V. However, if the particles are indistinguishable, then permuting the particle labels does not result in a distinct 
configuration, and so the configurational entropy is proportional to \n(V N /N\) ~ N\n(V/N) — N. The origin 
of this indistinguishability factor will become clear when we discuss the quantum mechanical formulation of 
statistical mechanics. For now, note that such a correction is necessary in order that the entropy be an extensive 
function. 

If we did not include this factor and instead wrote S* (T, V, N) = Nk B In V+N(j)(T), then we would find S* (T, V, N) - 
2S* (T, i V, t^N) = Nk B In 2, i.e. the total entropy of two identical systems of particles separated by a barrier will in- 
crease if the barrier is removed and they are allowed to mix. This seems absurd, though, because we could just as 
well regard the barriers as invisible. This is known as the Gibbs paradox. The resolution of the Gibbs paradox is to 
include the indistinguishability correction, which renders S extensive, in which case S(T, V, N) — 2S(T, \ V, ^N). 

Consider now the situation in Fig. 2.31, where we have separated the different components into their own volumes 
V a . Let the pressure and temperature be the same everywhere, so pV a = N a k B T. The entropy of the unmixed 
system is then 

(2.371) 



'unmixed 



Y, S a = Y, K K In (V a /JV ) + N a 4> a (T) 



Now let us imagine removing all the barriers separating the different gases and letting the particles mix thor- 
oughly. The result is that each component gas occupies the full volume V, so the entropy is 



^mixed / j a a / j 
a a 

Thus, the entropy of mixing is 

AC — q Q 

mix mixed unmixed 



= V S„ = V [N a k B ln(V/N a ) + N a d> a (T)] . (2.372) 



= N a K HV/V a ) - -Nk B J2 *a lnx Q , ( 2373 > 

a a 

where x a = -^p- = ^ is the fraction of species a. Note that AS^^ = 0. 

What if all the components were initially identical? It seems absurd that the entropy should increase simply by 
removing some invisible barriers. This is again the Gibbs paradox. In this case, the resolution of the paradox is to 
note that the sum in the expression for S , mixod is a sum over distinct species. Hence if the particles are all identical, 
we have S mixcd = Nk B ln(V/N) + N</>(T) = S unmixcd , hence A5 mix = 0. 
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Figure 2.31: A multicomponent system consisting of isolated gases, each at temperature T and pressure p. Then 
system entropy increases when all the walls between the different subsystems are removed. 



2.13.2 Entropy and combinatorics 

As we shall learn when we study statistical mechanics, the entropy may be interpreted in terms of the number of 
ways W(E, V, N) a system at fixed energy and volume can arrange itself. One has 

S(E, V, N) = k B In W(E, V, N) . (2.374) 

Consider a system consisting of a different species of particles. Now let it be that for each species label a, N a 
particles of that species are confined among Q a little boxes such that at most one particle can fit in a box (see Fig. 
2.32). How many ways W are there to configure N identical particles among Q boxes? Clearly 

w -Q-im^m- (2375) 

Were the particles distinct, we'd have W^istinct = (q-n)\ ' which is N\ times greater. This is because permuting 
distinct particles results in a different configuration, and there are Nl ways to permute N particles. 

The entropy for species a is then S a = k B In W a = k B In ^ a J . We then use Stirling's approximation, 

ln(K\) = K In K -K+\ hxK + \ ln(27r) + C(A' _1 ) , (2.376) 
which is an asymptotic expansion valid for K >> 1. One then finds for Q,N ^> 1, with x = N/Q £ [0, 1], 

(iv) = ( Q ln Q ~ Q ) ~ ( XQ ln(;C<5) -xQ)-({1- x)Q In ((1 - x)Q) - (1 - x)Q) 



In 



= -Q 



x \nx + (1 — x) ln(l — x) 



(2.377) 



This is valid up to terms of order Q in Stirling's expansion. Since In Q -C Q, the next term is small and we are safe 
to stop here. Summing up the contributions from all the species, we get 



a a 
-^unmixed = K hlW a = ~ fc B Qa [ X a lnX a + i 1 ~ X a) H 1 ~ % a) 



a=l 



(2.378) 
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Figure 2.32: Mixing among three different species of particles. The mixed configuration has an additional entropy, 

A^mix- 



where x a = N a /Q a is the initial dimensionless density of species a. 

Now let's remove all the partitions between the different species so that each of the particles is free to explore all 
of the boxes. There are Q = J2 a Qa boxes in all. The total number of ways of placing N 1 particles of species a = 1 
through N a particles of species a is 

where N = Q — Y^a=i * s ^ e number of vacant boxes. Again using Stirling's rule, we find 

(7 

■Smixod = -* B Q X] 5 « ' (2.380) 

a=0 

where x a = N a /Q is the fraction of all boxes containing a particle of species a, and N Q is the number of empty 
boxes. Note that 

N NO 

where /„ s QJQ. Note that /„ = 1. 

Let's assume all the densities are initially the same, so x a = xVa, sox a = x f a . In this case, / a = q = Tv" is the 
fraction of species a among all the particles. We then have x Q = 1 — x, and 

(7 

"Smixcd = — Q x fa ^ n ( x fa) ^^bQ^O m x 

a=1 a (2.382) 

= -fc B Q[ajlnaj+(l-aj)ln(l-x) - fc B x Q ^ / In / Q . 

0=1 

Thus, the entropy of mixing is 

A5 mix = -iVfc B ^/ a ln/ aJ (2.383) 

a=l 

where N = X)a=i -^o * s me t°t a l number of particles among all species (excluding vacancies) and f a = N a / (N + 
N )is the fraction of all boxes occupied by species a. 
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2.13.3 Weak solutions and osmotic pressure 



Suppose one of the species is much more plentiful than all the others, and label it with a = 0. We will call this the 
solvent. The entropy of mixing is then 



(2.384) 



where N' = Y^a=i ^ s the total number of solvent molecules, summed over all species. We assume the solution 
is weak, which means N a < N' <C N . Expanding in powers of N'/N and N a /N , we find 



a=l 



+ O(N' 2 /N ). (2.385) 



Consider now a solution consisting of N molecules of a solvent and N a molecules of species a of solute, where 
a = 1, . . . , a. We begin by expanding the Gibbs free energy G(T,p 7 N Q , 7V 1; . . . , N a ), where there are a species of 
solutes, as a power series in the small quantities N a . We have 

G(T, p, N , {N a }) = N g (T, p) + k B T £ N a ln( 

1 ° (2.386) 

+ N a Tp a (T,p) + ^y- E A ab(T,p) N a N b . 
a a ,b 

The first term on the RHS corresponds to the Gibbs free energy of the solvent. The second term is due to the 
entropy of mixing. The third term is the contribution to the total free energy from the individual species. Note 
the factor of e in the denominator inside the logarithm, which accounts for the second term in the brackets on the 
RHS of eqn. 2.385. The last term is due to interactions between the species; it is truncated at second order in the 
solute numbers. 

The chemical potential for the solvent is 

Vo(T,p) = Qjjr = 9o(T,p) - k B T"Y x a ~ h^2 A ab( T ^P)x a x b , (2.387) 

a a.b 

and the chemical potential for species a is 

Hafrp) = TjTj- = KTlnx a + i> a (T,p) + J2Ab(T, P )x b , (2.388) 

a b 

where x a = N a /N is the concentrations of solute species a. By assumption, the last term on the RHS of each of 
these equations is small, since N soluto ./V , where iV solutG = Yla=i N a i s m e total number of solute molecules. To 
lowest order, then, we have 

Mo (7»= 3o (7»-xfc B T (2.389) 
p a {T,p) = k B T\nx a + V^(T,p) , (2.390) 

where x = x a ^ s me total solute concentration. 

If we add sugar to a solution confined by a semipermeable membrane 18 , the pressure increases! To see why, 
consider a situation where a rigid semipermeable membrane separates a solution (solvent plus solutes) from a 



^Semipermeable' in this context means permeable to the solvent but not the solute(s). 
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■ 



Figure 2.33: Osmotic pressure causes the column on the right side of the U-tube to rise higher than the column on 
the left by an amount Ah = tt/ gg. 



pure solvent. There is energy exchange through the membrane, so the temperature is T throughout. There is no 
volume exchange, however: dV = dV' = 0, hence the pressure need not be the same. Since the membrane is 
permeable to the solvent, we have that the chemical potential fj, is the same on each side. This means 

9o(T,P R ) ~ xk B T = g (T,pA , (2.391) 

where p L R is the pressure on the left and right sides of the membrane, and x = N/N is again the total solute 
concentration. This equation once again tells us that the pressure p cannot be the same on both sides of the 
membrane. If the pressure difference is small, we can expand in powers of the osmotic pressure, tt = p B — p L , and 
we find 



( ^r) ■ ( 2 - 392 ) 



But a Maxwell relation (§2.9) guarantees 



dp,\ f dV 

d P/T,N \ dN /l 



dp J 7 

v(T,p)/N A , (2.393) 



where v(T,p) is the molar volume of the solvent. 



nv = xRT , (2.394) 



which looks very much like the ideal gas law, even though we are talking about dense (but 'weak') solutions! The 
resulting pressure has a demonstrable effect, as sketched in Fig. 2.33. Consider a solution containing v moles of 
sucrose (C 12 H 22 11 ) per kilogram (55.52 mol) of water at 30° C. We find tt = 2.5 atm when v = 0.1. 

One might worry about the expansion in powers of ix when 7r is much larger than the ambient pressure. But in 
fact the next term in the expansion is smaller than the first term by a factor of ttk t , where k t is the isothermal 
compressibility. For water one has k t « 4.4 x 10~ 5 (atm) -1 , hence we can safely ignore the higher order terms in 
the Taylor expansion. 



2.13.4 Effect of impurities on boiling and freezing points 

Along the coexistence curve separating liquid and vapor phases, the chemical potentials of the two phases are 
identical: 

lil(T,p) = £(T,p). (2.395) 
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2270 
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Table 2.4: Latent heats of fusion and vaporization at p = 1 atm. 



Here we write fi° for fi to emphasize that we are talking about a phase with no impurities present. This equation 
provides a single constraint on the two variables T and p, hence one can, in principle, solve to obtain T = T * (p), 
which is the equation of the liquid-vapor coexistence curve in the (T,p) plane. Now suppose there is a solute 
present in the liquid. We then have 

^(T,p,x) = fi(T,p) - xk B T , (2.396) 

where x is the dimensionless solute concentration, summed over all species. The condition for liquid-vapor coex- 
istence now becomes 

f j°(T,p)-xk B T = fi(T,p). (2.397) 

This will lead to a shift in the boiling temperature at fixed p. Assuming this shift is small, let us expand to lowest 
order in (T - T * (p)) , writing 



Note that 

'S/A _ fdS\ 



(2.398) 



8TJ \dNj (2399) 

U1 Jp,N \ uly /T,p 

from a Maxwell relation deriving from exactness of dG. Since S is extensive, we can write S = (N/N A ) s(T,p), 
where s(T,p) is the molar entropy. Solving for T, we obtain 

T* (p, x) = T* (p) + xR ^f 2 , (2.400) 

where £ v = T * • (s v — s L ) is the latent heat of the liquid-vapor transition 19 . The shift AT* = T* — T * is called the 
boiling point elevation. 

As an example, consider seawater, which contains approximately 35 g of dissolved Na + Cl~ per kilogram of H 2 0. 
The atomic masses of Na and CI are 23.0 and 35.4, respectively, hence the total ionic concentration in seawater 
(neglecting everything but sodium and chlorine) is given by 

2 ' 35 /100 ° 0.022. (2.401) 



23.0 + 35.4/ 18 



'We shall discuss latent heat again in §2.12.2 below. 
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The latent heat of vaporization of H 2 at atmospheric pressure is £ = 40.7 kJ/mol, hence 

AT* = (0-022)(8.3J/inolK)(373K)' 

4.1 x 10 4 J/mol v ; 

Put another way, the boiling point elevation of H 2 at atmospheric pressure is about 0.28°C per percent solute. 
We can express this as AT* = Km, where the molality m is the number of moles of solute per kilogram of solvent. 
For H 2 0, we find K = 0.51°C kg/mol. 

Similar considerations apply at the freezing point, when we equate the chemical potential of the solvent plus 
solute to that of the pure solid. The latent heat of fusion for H 2 is about £ f = T f ° ■ (s LIQUID — s SOLID ) = 6.01 kJ/mol 20 

We thus predict a freezing point depression of AT* = —xR[Tq] 2 /£ { = 1.03°C • x[%}. This can be expressed once 
again as AT* = -Km, with K = 1.86°C kg/mol 21 . 



2.13.5 Binary solutions 

Consider a binary solution, and write the Gibbs free energy G(T, p, N A , N B ) as 
G(T 7 p, N A , N B ) = N A p° A (T,p) + N B M ° B (T,p) + N A k B T In 



N A + N R 

' A B/ (2.403) 

\N A + N B J N A + N B 

The first four terms on the RHS represent the free energy of the individual component fluids and the entropy of 
mixing. The last term is an interaction contribution. With A > 0, the interaction term prefers that the system be 
either fully A or fully B. The entropy contribution prefers a mixture, so there is a competition. What is the stable 
thermodynamic state? 

It is useful to write the Gibbs free energy per particle, g(T, p, x) = G/(N A + N B ), in terms of T, p, and the concen- 
tration x = x B — N B /(N A + N B ) of species B (hence x A = 1 — x is the concentration of species A). Then 

g(T,p,x) = (l-x)p° A + xp B + k B T x In x + (1 - x) ln(l - x) +Xx(l-x) . (2.404) 

In order for the system to be stable against phase separation into relatively A-rich and B-rich regions, we must have 
that g(T, p, x) be a convex function of x. Our first check should be for a local instability, i.e. spinodal decomposition. 
We have 

|f = Mb - Ma + KT In (j^) + A ^ ~ 2x ) ( Z405 ) 

and 

d 2 g k B T k B T ^ 

^4 = + T- 2 2A • (2.406) 

ox z x 1 — x 

The spinodal is given by the solution to the equation = 0, which is 

2A 

T*(x) = — x(l -x) . (2.407) 

k B 

Since x (1 — x) achieves its maximum value of 4 at x = \, we have T* < k B /2X. 



20 See table 2.4, and recall M = 18 g is the molar mass of H 2 0. 

21 It is more customary to write AT* = T* e solvont — Solution ™ tne case °^ tne f reezm g point depression, in which case AT* is positive. 
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0.2 0.4 0.6 0.8 1 
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Figure 2.34: Gibbs free energy per particle for a binary solution as a function of concentration x = xb of the B 
species (pure A at the left end x = ; pure B at the right end x = 1), in units of the interaction parameter A. Dark 
red curve: T = 0.65 A/fc B > T c ; green curve: T = X/2k B = T c ; blue curve: T = 0.40 A/fc B < T c . We have chosen 
fxl = 0.60 A - 0.50 k B T and ^ = 0-50 A — 0. 50 k B T. Note that the free energy g(T,p, x) is not convex in x for 
T < T c , indicating an instability and necessitating a Maxwell construction. 



In Fig. 2.34 we sketch the free energy g(T,p, x) versus x for three representative temperatures. For T > A/2fc B , the 
free energy is everywhere convex in A. When T < A/2fc B/ there free energy resembles the blue curve in Fig. 2.34, 
and the system is unstable to phase separation. The two phases are said to be immiscible, or, equivalently, there 
exists a solubility gap. To determine the coexistence curve, we perform a Maxwell construction, writing 



gQgg) ~ g(gi) = 



dg 

dx 



(2.408) 



Here, x 1 and x 2 are the boundaries of the two phase region. These equations admit a symmetry of x *-> 1 — x, 
hence we can set x = x x and x 2 = 1 — x. We find 

3 (1 -x)- g(x) = (1 - 2x) {f4 - ft) , (2.409) 

and invoking eqns. 2.408 and 2.405 we obtain the solution 

The phase diagram for the binary system is shown in Fig. 2.36. For T < T*(x), the system is unstable, and 
spinodal decomposition occurs. For T* (x) < T < T cocx (x), the system is metastable, just like the van der Waals gas 
in its corresponding regime. Real binary solutions behave qualitatively like the model discussed here, although 
the coexistence curve is generally not symmetric under x <-> 1 — x, and the single phase region extends down to 
low temperatures for x ~ and x w 1. If A itself is temperature-dependent, there can be multiple solutions to 
eqns. 2.407 and 2.410. For example, one could take 

A(T) = T^Sf • (Z411) 
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Figure 2.35: Upper panels: chemical potential shifts A[i± = A^a ± A^b versus concentration x = x B . The dashed 
black line is the spinodal, and the solid black line the coexistence boundary. Temperatures range from T = (dark 
blue) to T = 0.6 X/k B (red) in units of 0.1 X/k B . Lower panels: phase diagram in the (T, A/x±) planes. The black 
dot is the critical point. 



In this case, k B T > A at both high and low temperatures, and we expect the single phase region to be reentrant. 
Such a phenomenon occurs in water-nicotine mixtures, for example. 



It is instructive to consider the phase diagram in the (T, /.t) plane. We define the chemical potential shifts, 

A/i A = /Lt A - /i A = fc B Tln(l - x) + Xx 2 



Mb 



Mb 



k B T\nx + A(l -xf 



(2.412) 
(2.413) 



and their sum and difference, A/i ± = A/x A ± A^ B . From the Gibbs-Duhem relation, we know that we can write fi B 
as a function of T, p, and fj, A . Alternately, we could write A/x ± in terms of T, p, and A^ T , so we can choose which 
among A/i + and AyU_ we wish to use in our phase diagram. The results are plotted in Fig. 2.35. It is perhaps 
easiest to understand the phase diagram in the (T, A/x_) plane. At low temperatures, below T = T c = X/2k B , 
there is a first order phase transition at A/i_ = 0. For T < T c = X/2k B and A/x_ = + , i.e. infinitesimally positive, 
the system is in the A-rich phase, but for A/i_ = 0~, i.e. infinitesimally negative, it is B-rich. The concentration 
x = x B changes discontinuously across the phase boundary. The critical point lies at (T, Afi_) = (X/2k B , 0). 

If we choose N = N A + N B to be the extensive variable, then fixing N means dN A + dN B = 0. So st fixed T and p, 

dG\ Tp = n A dN A + ti B dN B => dg\ Tp = -A l x_dx . (2.414) 
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Figure 2.36: Phase diagram for the binary system. The black curve is the coexistence curve, and the dark red curve 
is the spinodal. A-rich material is to the left and B-rich to the right. 

Since Afi_(x,T) = <p(x,T) - <p(l - x,T) = -A/i_(l - x,T), where <p(x,T) = Xx - k B T\nx, we have that the 

1-3 

coexistence boundary in the (x, A_ ) plane is simply the line A/i_ = 0, because Jdx' A//_ (x' , T) = 0. 

Note also that there is no two-phase region in the (T, A/i) plane; the phase boundary in this plane is a curve 
which terminates at a critical point. As we saw in §2.12, the same situation pertains in single component (p, v, T) 
systems. That is, the phase diagram in the (p, v) or (T, v) plane contains two-phase regions, but in the (p, T) plane 
the boundaries between phases are one-dimensional curves. Any two-phase behavior is confined to these curves, 
where the thermodynamic potentials are singular. 

The phase separation can be seen in a number of systems. A popular example involves mixtures of water and 
ouzo or other anise-based liqueurs, such as arak and absinthe. Starting with the pure liqueur (x = 1), and at a 
temperature below the coexistence curve maximum, the concentration is diluted by adding water. Follow along 
on Fig. 2.36 by starting at the point (x = 1 , k^T/X = 0.4) and move to the left. Eventually, one hits the boundary 
of the two-phase region. At this point, the mixture turns milky, due to the formation of large droplets of the pure 
phases on either side of coexistence region which scatter light, a process known as spontaneous emulsification 22 . As 
one continues to dilute the solution with more water, eventually one passes all the way through the coexistence 
region, at which point the solution becomes clear once again, and described as a single phase. 

What happens if A < ? In this case, both the entropy and the interaction energy prefer a mixed phase, and there is 
no instability to phase separation. The two fluids are said to be completely miscible. An example would be benzene, 
C 6 H 6 , and toluene, C 7 H g (i.e. C 6 H 5 CH 3 ). The phase diagram would be blank, with no phase boundaries below 
the boiling transition, because the fluid could exist as a mixture in any proportion. 

Any fluid will eventually boil if the temperature is raised sufficiently high. Let us assume that the boiling points 
of our A and B fluids are T* B , and without loss of generality let us take T£ < Tg at some given fixed pressure 23 . 
This means fi\(T£,p) = fi^(T^,p) and p,q(Tq,p) = plq(Tq,p). What happens to the mixture? We begin by writing 



An emulsion is a mixture of two or more immiscible liquids. 
'We assume the boiling temperatures are not exactly equal! 
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Figure 2.37: Gibbs free energy per particle g for an ideal binary solution for temperatures T G [Tj^Tg]. The 
Maxwell construction is shown for the case < T < T B . Right: phase diagram, showing two-phase region and 
distillation sequence in (x, T) space. 



the free energies of the mixed liquid and mixed vapor phases as 

g^(T,p,x) = (l-x)ijfc(T,p)+xijfa(T,p) + k B T x\nx + (1 - x) ln(l - x) + A L x(l — x) (2.415) 
g v (T,p,x) = {l-x)fil(T,p) + xf4(T,p) + k n T\x\iLx+(l-x)lii(l-x)\ +\ Y x(l-x) . (2.416) 

Typically A v as 0. Consider these two free energies as functions of the concentration x, at fixed T and p. If the 
curves never cross, and g^{x) < g v (x) for all x G [0, 1], then the liquid is always the state of lowest free energy. 
This is the situation in the first panel of Fig. 2.37. Similarly, if g v (x) < g h (x) over this range, then the mixture is in 
the vapor phase throughout. What happens if the two curves cross at some value of x? This situation is depicted 
in the second panel of Fig. 2.37. In this case, there is always a Maxwell construction which lowers the free energy 
throughout some range of concentration, i.e. the system undergoes phase separation. 

In an ideal fluid, we have A L = A v = 0, and setting g L = g v requires 

(1 - x) A^ A (7» + xAn B (T,p) = , (2.417) 

where A / u A y B (T,p) = /i A , B (T,p) — fi^, B (T,p). Expanding the chemical potential about a given temperature T* , 



fx(T,p) = v{T*,p) - s(T*,p) (T - T*) - Cp{ ll ,P) (T - T*) 2 



2T 



(2.418) 



where we have used {^) pN = 
expanding A^ A/ , B about T£, B , we have 

AMA = MA-MX = (*A-*A)(r-TE 



{m) T , P = - S ( T >P)' the entropy per particle, and (§^) pN = c p /T. Thus, 



2T* 



(T-7%) 



A MB = /-ig - & = (si - S L B )(T - T B *) + CpB 2T y B (T - Tgf 

B 



(2.419) 
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Figure 2.38: Negative (left) and positive (right) azeotrope phase diagrams. From Wikipedia. 



We assume , B > sJy B , i.e. the vapor phase has greater entropy per particle. Thus, A/x A/ , B (T) changes sign from 
negative to positive as T rises through T^ B . If we assume that these are the only sign changes for Ap A ^ B (T) at 
fixed p, then eqn. 2.417 can only be solved for T € [T^, T B ]. This immediately leads to the phase diagram in the 
rightmost panel of Fig. 2.37. 

According to the Gibbs phase rule, with a = 2, two-phase equilibrium (tp = 2) occurs along a subspace of dimen- 
sion d PE = 2 + <t — ip = 2. Thus, if we fix the pressure p and the concentration x = x B , liquid-gas equilibrium 
occurs at a particular temperature T* , known as the boiling point. Since the liquid and the vapor with which it 
is in equilibrium at T* may have different composition, i.e. different values of x, one may distill the mixture to 
separate the two pure substances, as follows. First, given a liquid mixture of A and B, we bring it to boiling, as 
shown in the rightmost panel of Fig. 2.37. The vapor is at a different concentration x than the liquid (a lower value 
of x if the boiling point of pure A is less than that of pure B, as shown). If we collect the vapor, the remaining fluid 
is at a higher value of x. The collected vapor is then captured and then condensed, forming a liquid at the lower 
x value. This is then brought to a boil, and the resulting vapor is drawn off and condensed, etc The result is a 
purified A state. The remaining liquid is then at a higher B concentration. By repeated boiling and condensation, 
A and B can be separated. For liquid-vapor transitions, the upper curve, representing the lowest temperature at a 
given concentration for which the mixture is a homogeneous vapor, is called the dew point curve. The lower curve, 
representing the highest temperature at a given concentration for which the mixture is a homogeneous liquid, 
is called the bubble point curve. The same phase diagram applies to liquid-solid mixtures where both phases are 
completely miscible. In that case, the upper curve is called the liquidus, and the lower curve the solidus. 

When a homogeneous liquid or vapor at concentration x is heated or cooled to a temperature T such that (x, T) 
lies within the two-phase region, the mixture phase separates into the the two end components (x* , T) and (x* , T), 
which lie on opposite sides of the boundary of the two-phase region, at the same temperature. The locus of points 
at constant T joining these two points is called the tie line. To determine how much of each of these two homo- 
geneous phases separates out, we use particle number conservation. If 7? L v is the fraction of the homogeneous 
liquid and homogeneous vapor phases present, then r/ L x* + ?y v a;* = x, which says r/ L = (x — )/(x* — x^) and 
Vv = ( x — x *)/( x v ~ x *)- This is known as the lever rule. 
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Figure 2.39: Free energies before Maxwell constructions for a binary fluid mixture in equilibrium with a vapor 
(A v = 0). Panels show (a) A L = (ideal fluid), (b) A L < (miscible fluid; negative azeotrope), (c) A^ B > 
(positive azeotrope), (d) A^ B > (heteroazeotrope). Thick blue and red lines correspond to temperatures T£ and 
Tg, respectively, with < Xg\ Thin blue and red curves are for temperatures outside the range [T^,Tg]. The 
black curves show the locus of points where g is discontinuous, i.e. where the liquid and vapor free energy curves 
cross. The yellow curve in (d) corresponds to the coexistence temperature for the fluid mixture. In this case the 
azeotrope forms within the coexistence region. 



For many binary mixtures, the boiling point curve is as shown in Fig. 2.38. Such cases are called azeotropes. For 
negative azeotropes, the maximum of the boiling curve lies above both B . The free energy curves for this case 
are shown in panel (b) of Fig. 2.39. For x < x* , where x* is the azeotropic composition, one can distill A but 
not B. Similarly, for x > x* one can distill B but not A. The situation is different for positive azeotropes, where the 
minimum of the boiling curve lies below both B , corresponding to the free energy curves in panel (c) of Fig. 
2.39. In this case, distillation (i.e. condensing and reboiling the collected vapor) from either side of x* results in 
the azeotrope. One can of course collect the fluid instead of the vapor. In general, for both positive and negative 
azeotropes, starting from a given concentration x, one can only arrive at pure A plus azeotrope (if x < x*) or 
pure B plus azeotrope (if x > x*). Ethanol (C 2 H 5 OH) and water (H 2 0) form a positive azeotrope which is 95.6% 
ethanol and 4.4% water by weight. The individual boiling points are T£ H OH = 78.4°C, T£ = 100°C, while 
the azeotrope boils at T* z = 78. 2° C. No amount of distillation of this mixture can purify ethanol beyond the 
95.6% level. To go beyond this level of purity, one must resort to azeotropic distillation, which involved introducing 
another component, such as benzene (or a less carcinogenic additive), which alters the molecular interactions. 
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Figure 2.40: Phase diagram for a eutectic mixture in which a liquid L is in equilibrium with two solid phases a and 
(3. The same phase diagram holds for heteroazeotropes, where a vapor is in equilibrium with two liquid phases. 



To model the azeotrope system, we need to take A L ^ 0, in which case one can find two solutions to the energy 
crossing condition g v (x) = g L (x). With two such crossings come two Maxwell constructions, hence the phase dia- 
grams in Fig. 2.38. Generally, negative azeotropes are found in systems with A L < , whereas positive azeotropes 
are found when A L > 0. As we've seen, such repulsive interactions between the A and B components in general 
lead to a phase separation below a coexistence temperature T COEX (a;) given by eqn. 2.410. What happens if the 
minimum boiling point lies within the coexistence region? This is the situation depicted in panel (d) of Fig. 2.39. 
The system is then a liquid/vapor version of the solid/liquid eutectic (see Fig. 2.40), and the minimum boiling 
point mixture is called a heteroazeotrope. 



2.14 Some Concepts in Thermochemistry 
2.14.1 Chemical reactions and the law of mass action 

Suppose we have a chemical reaction among a species, written as 

d^+GA^-. + CA^O, (2.420) 

where 

A a = chemical formula 
C a = stoichiometric coefficient . 

For example, we could have 

-3H 2 -N 2 + 2NH 3 = (3H 2 +N 2 ^ 2NH 3 ) (2.421) 

for which 

C(H 2 ) = -3 , C(N 2 ) = -1 , C(NH 3 ) = 2. (2.422) 
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When C a > 0, the corresponding A a is a product; when £ a < 0, the corresponding A a is a reactant. The bookkeeping 
of the coefficients ( a which ensures conservation of each individual species of atom in the reaction(s) is known as 
stoichiometry 24 

Now we ask: what are the conditions for equilibrium? At constant T and p, which is typical for many chemical 
reactions, the conditions are that G(T,p, {N a }) be a minimum. Now 

dG = -S dT + V dp + J2 dN a , ( 2 - 42 3) 

i 

so if we let the reaction go forward, we have dN a = ( a , and if it runs in reverse we have dN a = — ( a . Thus, setting 
dT = dp = 0, we have the equilibrium condition 

(T 

ECMa = 0. (2.424) 

a=l 

Let us investigate the consequences of this relation for ideal gases. The chemical potential of the a" 1 species is 

H a (T,p) = k B Tcp a (T) + KT\np a . (2.425) 

Here p a = px a is the partial pressure of species a, where x a = N a /J2b^b m e dimensionless concentration of 
species a. Chemists sometimes write x a = [A a ] for the concentration of species a. In equilibrium we must have 



Y,(a[^p + lnx a + ( f> a (T) 

a 

which says 



, (2.426) 



a a a 

Exponentiating, we obtain the law of mass action: 

J] *£« = p- C exp L J2 C a K (T)) = «(p, T) . (2.428) 



The quantity n(p, T) is called the equilibrium constant. When k is large, the LHS of the above equation is large. 
This favors maximal concentration x a for the products (C a > 0) and minimal concentration x a for the reactants 
(C Q < 0). This means that the equation REACTANTS ^ PRODUCTS is shifted to the right, i.e. the products are 
plentiful and the reactants are scarce. When k is small, the LHS is small and the reaction is shifted to the left, i.e. 
the reactants are plentiful and the products are scarce. Remember we are describing equilibrium conditions here. 
Now we observe that reactions for which ^ a £ a > shift to the left with increasing pressure and shift to the right 



24 Antoine Lavoisier, the "father of modern chemistry", made pioneering contributions in both chemistry and biology. In particular, he is 
often credited as the progenitor of stoichiometry. An aristocrat by birth, Lavoisier was an administrator of the Ferme generate, an organization 
in pre-revolutionary France which collected taxes on behalf of the king. At the age of 28, Lavoisier married Marie-Anne Pierette Paulze, the 
13-year-old daughter of one of his business partners. She would later join her husband in his research, and she played a role in his disproof of 
the phlogiston theory of combustion. The phlogiston theory was superseded by Lavoisier's work, where, based on contemporary experiments 
by Joseph Priestley, he correctly identified the pivotal role played by oxygen in both chemical and biological processes (i.e. respiration). Despite 
his fame as a scientist, Lavoisier succumbed to the Reign of Terror. His association with the Ferme generate, which collected taxes from the poor 
and the downtrodden, was a significant liability in revolutionary France (think Mitt Romney vis-a-vis Bain Capital). Furthermore - and let 
this be a lesson to all of us - Lavoisier had unwisely ridiculed a worthless pseudoscientific pamphlet, ostensibly on the physics of fire, and 
its author, Jean-Paul Marat. Marat was a journalist with scientific pretensions, but apparently little in the way of scientific talent or acumen. 
Lavoisier effectively blackballed Marat's candidacy to the French Academy of Sciences, and the time came when Marat sought revenge. Marat 
was instrumental in getting Lavoisier and other members of the Ferme generate arrested on charges of counterrevolutionary activities, and on 
May 8, 1794, after a trial lasting less than a day, Lavoisier was guillotined. Along with Fourier and Carnot, Lavoisier's name is one of the 72 
engraved on the Eiffel Tower. Source: http://www.vigyanprasar.gov.in/scientists/ALLavoisier.htm. 
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with decreasing pressure, while reactions for which ^ Q ( a > the situation is reversed: they shift to the right with 
increasing pressure and to the left with decreasing pressure. When J2 a C a = there is no shift upon increasing or 
decreasing pressure. 

The rate at which the equilibrium constant changes with temperature is given by 

<9 In « 



v 



dT 

Now from eqn. 2.425 we have that the enthalpy per particle for species i is 



= -£C„&CT). (2.429) 



dT 



since H = G + TS and S = — (fp 1. We find 

h (l = -fc B T 2 ^(T), (2.431) 

and thus 



01n K \ _ EiC a h„ _ Ah 



dT J p k B T? k B T? ' 
where Ah is the enthalpy of the reaction, which is the heat absorbed or emitted as a result of the reaction. 



(2.432) 



When Ah > the reaction is endothermic and the yield increases with increasing T. When Ah < the reaction is 
exothermic and the yield decreases with increasing T. 

As an example, consider the reaction H 2 + I 2 ^ 2 HI. We have 

C(H 2 ) = -1 , C(I 2 ) = -1 C(HI) = 2. (2.433) 

Suppose our initial system consists of v\ moles of H 2 , i/ 2 = moles of I 2 , and v\ moles of undissociated HI . These 
mole numbers determine the initial concentrations x®, where x a = v a j J^b v b- Define 

a = , (2.434) 



in which case we have 



Then the law of mass action gives 



4(1 -a) 2 



x 3 = (l-a)x° 3 . (2.435) 



(2.436) 



a(a + 2r) 

where r = x^/x® = v\jv^. This yields a quadratic equation, which can be solved to find a(n, r). Note that 
k = k(T) for this reaction since J2a Ca = 0. The enthalpy of this reaction is positive: Ah > 0. 

2.14.2 Enthalpy of formation 

Most chemical reactions take place under constant pressure. The heat Q u associated with a given isobaric process 
is 



Qx = JdE + JpdV = (E f - E { ) +p( Vi -V i ) = H i -H i , 
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AHf 

1 








AH' J 

1 


Formula 


Name 


State 


kJ/mol 


Formula 


Name 


State 


kJ/mol 


Ag 


Silver 


crystal 


0.0 


NiS0 4 


Nickel sulfate 


crystal 


-872.9 


C 


Graphite 


crystal 


0.0 


A1 2 3 


Aluminum oxide 


crystal 


-1657.7 


C 


Diamond 


crystal 


1.9 


Ca 3 P 2 8 


Calcium phosphate 


gas 


-4120.8 


3 


Ozone 


gas 


142.7 


HCN 


Hydrogen cyanide 


liquid 


108.9 


H 2 


Water 


liquid 


-285.8 


SF 6 


Sulfur hexafluoride 


gas 


-1220.5 


H3BO3 


Boric acid 


crystal 


-1094.3 


CaF 2 


Calcium fluoride 


crystal 


-1228.0 


ZnSQ 4 


Zinc sulfate 


crystal 


-982.8 


CaCl 2 


Calcium chloride 


crystal 


-795.4 



Table 2.5: Enthalpies of formation of some common substances. 



where H is the enthalpy, 

H = E + P V . (2.438) 

Note that the enthalpy H is a state function, since E is a state function and p and V are state variables. Hence, 
we can meaningfully speak of changes in enthalpy: AH = H f — Hj. If AH < for a given reaction, we call 
it exothermic - this is the case when Q if < and thus heat is transferred to the surroundings. Such reactions 
can occur spontaneously, and, in really fun cases, can produce explosions. The combustion of fuels is always 
exothermic. If AH > 0, the reaction is called endothermic. Endothermic reactions require that heat be supplied in 
order for the reaction to proceed. Photosynthesis is an example of an endothermic reaction. 



Suppose we have two reactions 
and 

Then we may write 
with 



A + B (AH)l » C (2.439) 
C + D (AH)2 ; E . (2.440) 
A + B + D (AH)a ) E , (2.441) 



(AH)! + (AH) 2 = (AH) 3 . (2.442) 

We can use this additivity of reaction enthalpies to define a standard molar enthalpy of formation. We first define the 
standard state of a pure substance at a given temperature to be its state (gas, liquid, or solid) at a pressure p = 1 bar. 
The standard reaction enthalpies at a given temperature are then defined to be the reaction enthalpies when the 
reactants and products are all in their standard states. Finally, we define the standard molar enthalpy of formation 
AH° (X) of a compound X at temperature T as the reaction enthalpy for the compound X to be produced by its 
constituents when they are in their standard state. For example, if X = S0 2 , then we write 

S + 2 AH " 1S ° 21 . S0 2 . (2.443) 

The enthalpy of formation of any substance in its standard state is zero at all temperatures, by definition: AHf [0 2 ] = 
AH° [He] = AH°[K] = AH° [Mn] = 0,etc. 

Suppose now we have a reaction 

aA + bB AH , cC + dD. (2.444) 

To compute the reaction enthalpy AH, we can imagine forming the components A and B from their standard state 
constituents. Similarly, we can imagine doing the same for C and D. Since the number of atoms of a given kind is 
conserved in the process, the constituents of the reactants must be the same as those of the products, we have 

AH = -a AHf (A) - &AH?(S) + cAH?(C) + d AH?(£>) . (2.445) 
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< 
T. 



standard states 




react 



reactants 



(AH), 



prod 



products 



REACTION COORDINATE 



Figure 2.41: Left panel: reaction enthalpy and activation energy (exothermic case shown). Right panel: reaction 
enthalpy as a difference between enthalpy of formation of reactants and products. 

A list of a few enthalpies of formation is provided in table 2.5. Note that the reaction enthalpy is independent of 
the actual reaction path. That is, the difference in enthalpy between A and B is the same whether the reaction is 



A — > B or A — ► X 
Note that 

hence 

We therefore have 



(Y + Z) 



B. This statement is known as Hess's Law. 



dH = dE + pdV + V dp = dQ + V dp . 



dQ\ _ fdH 



dT 



1 

H(T,p,v) = H(T ,p,v) +ujdT' c p (T') 



(2.446) 
(2.447) 

(2.448) 



For ideal gases, we have c_(T) = (I+5/) R- For real gases, over a range of temperatures, there are small variations: 



c p (T) = a + f3T + 1 T 2 
Two examples (300 K < T < 1500 K, p = 1 atm): 



(2.449) 



2 
H 2 



a = 25.503 
a = 30.206 



mol K 
J 

mol K 



= 13.612 x 10" 
/? = 9.936 x 10~ : 



molK 2 



mol K 2 



7 = -42.553 x 10~ 7 
7 = 11.14 x 10~ 7 



molK 3 
J 



mol K 3 



If all the gaseous components in a reaction can be approximated as ideal, then we may write 

(AH) rxn = (A£) rxn + J2 C a RT , (2.450) 

a 

where the subscript 'rxn' stands for 'reaction'. Here (Ai£) rxn is the change in energy from reactants to products. 



2.14. SOME CONCEPTS IN THERMOCHEMISTRY 



97 



Doncl 


enthalpy 

(h-i / VT , n i'i 

^KJ / lllul j 


bond 


enthalpy 

(\r T /mr»l N l 
l KJ / I11U1 1 


bonci 


enthalpy 

fVT /rnnl^ 
^KJ / lllOl 1 


bond 


enthalpy 


H - H 


436 


c - c 


348 


c - s 


259 


F - F 


155 


H - C 


412 


c = c 


612 


N - N 


163 


F - CI 


254 


H - N 


388 


c = c 


811 


N = N 


409 


CI - Br 


219 


H — O 

-L-L V J 


463 


C — N 


305 


N = N 


945 


CI — T 


?10 


H-F 


565 


C = N 


613 


N-0 


157 


Cl-S 


250 


H-Cl 


431 


C = N 


890 


N-F 


270 


Br - Br 


193 


H-Br 


366 


C-0 


360 


N-Cl 


200 


Br-I 


178 


H-I 


299 


C = 


743 


N-Si 


374 


Br-S 


212 


H-S 


338 


C-F 


484 


0-0 


146 


I-I 


151 


H-P 


322 


C-Cl 


338 


= 


497 


s-s 


264 


H-Si 


318 


C-Br 


276 


-F 


185 


p - p 


172 






C-I 


238 


0-C1 


203 


Si — Si 


176 



Table 2.6: Average bond enthalpies for some common bonds. (Source: L. Pauling, The Nature of the Chemical Bond 
(Cornell Univ. Press, NY, I960).) 



2.14.3 Bond enthalpies 

The enthalpy needed to break a chemical bond is called the bond enthalpy, h [ • ] . The bond enthalpy is the energy 
required to dissociate one mole of gaseous bonds to form gaseous atoms. A table of bond enthalpies is given in 
Tab. 2.6. Bond enthalpies are endothermic, since energy is required to break chemical bonds. Of course, the actual 
bond energies can depend on the location of a bond in a given molecule, and the values listed in the table reflect 
averages over the possible bond environment. 

The bond enthalpies in Tab. 2.6 may be used to compute reaction enthalpies. Consider, for example, the reaction 
2 H 2 (g) + 2 (g) — ► 2 H 2 0(l). We then have, from the table, 

(AH) rxn = 2h[H-H] + h[0 = 0]-4h[H-0] 
= -483kJ/mol0 2 . 

Thus, 483 kj of heat would be released for every two moles of H 2 O produced, if the H 2 O were in the gaseous phase. 
Since H 2 is liquid at STP, we should also include the condensation energy of the gaseous water vapor into liquid 
water. At T = 100°C the latent heat of vaporization is I = 2270 J/g, but at T = 20°C, one has I = 2450 J/g, hence 
with M = 18 we have I = 44.1 kJ/mol. Therefore, the heat produced by the reaction 2 H 2 (g) + 2 (g) 2 H 2 0(l) 
is (AH) rxn = —571.2 kJ / mol 2 . Since the reaction produces two moles of water, we conclude that the enthalpy 
of formation of liquid water at STP is half this value: AH" [H 2 0] = 285.6 kJ/mol. 

Consider next the hydrogenation of ethene (ethylene): C 2 H 4 + H 2 C 2 H 6 . The product is known as ethane. 
The energy accounting is shown in Fig. 2.42. To compute the enthalpies of formation of ethene and ethane from 
the bond enthalpies, we need one more bit of information, which is the standard enthalpy of formation of C(g) 
from C(s), since the solid is the standard state at STP. This value is AHf [C(g)] = 718 kJ/mol. We may now write 

2C(g)+4H(g) ~ 2260kJ » C 2 H 4 (g) 

2C(s) 1436 kJ ; 2C(g) 

2H 2 (g) -^-^ 4H(g). 

Thus, using Hess's law, i.e. adding up these reaction equations, we have 

2C(s) + 2H 2 (g) ^ ► C 2 H 4 (g). 
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# 


bond 


cost (kj) 


total (kj) 


1 


H-H 


436 


436 


4 


H-C 


413 


1652 


1 


C=C 


614 


614 


TOTAL 


2696 



(AH), 



124 kJ 




H H H 
H H H 

C C 



(bond) 

prod 



# 


bond 


cost (kj) 


total (kj) 


1 


C-C 


348 


436 


6 


H-C 


413 


2478 


TOTAL 


2824 



Figure 2.42: Calculation of reaction enthalpy for the hydrogenation of ethene (ethylene), C2H4 



Thus, the formation of ethene is endothermic. For ethane, 

2C(g) + 6H(g) ~ 2820kJ 
2C(s) 
3H 2 (g) 



For ethane, 

which is exothermic. 



2C(s) + 3H 2 (g) 



1436 kj 



1306 kj 



— 76 kj 



C 2 H 6 (g) 

2C(g) 

6H(g) 

CMr) 



2.15 Appendix I : Integrating factors 



Suppose we have an inexact differential 

a\¥ = Ai dxi . (2.452) 

Here I am adopting the 'Einstein convention' where we sum over repeated indices unless otherwise explicitly 
stated; A i dx i = ^\ A i dx i . An integrating factor e L ^ is a function which, when divided into dF, yields an exact 
differential: 

dU 



du = e - L dw = 



dx. 



dX; . 



Clearly we must have 



d 2 U d 

dx i dx- dx i 



A A = 



dx. 



~ L A, 



Applying the Leibniz rule and then multiplying by e L yields 



d_A L _ A dL = dA 1 _ A dL_ 

dx { 3 dx i dxj 1 dxj 



(2.453) 
(2.454) 

(2.455) 
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If there are K independent variables {x 1 , . . . , x K }, then there are \K(K — 1) independent equations of the above 
form - one for each distinct pair. These equations can be written compactly as 



dL 
where 



Ux~ F «' (Z456) 



n ijk =A j 8 ik -A i 8 jk (2.457) 
dA- dA 

Note that is antisymmetric, and resembles a field strength tensor, and that J7^ fc = is antisymmetric in 

the first two indices (but is not totally antisymmetric in all three). 

Can we solve these \K(K — 1) coupled equations to find an integrating factor LI In general the answer is no. 
However, when K = 2 we can always find an integrating factor. To see why, let's call x = x 1 and y = x 2 - Consider 
now the ODE 

dy A x (x,y) 



dx A y (x,y) 



(2.459) 



This equation can be integrated to yield a one-parameter set of integral curves, indexed by an initial condition. 
The equation for these curves may be written as U c (x, y) = 0, where c labels the curves. Then along each curve 
we have 



Q = dU c = dU x | dU c dy 
dx dx dy dx 

= dU £ _A x dU £ 
dx A y dy 

Thus, 



(2.460) 



du £A = &u l 

dx v dy 
This equation defines the integrating factor L 



A y = —±A X = er L A x A y . (2.461) 



We now have that 



A x = e — , A y = e — , (2.463) 



and hence 

dx dy 



dll dU 

L3W = —±dx + —±dy = dU c . (2.464) 



2.16 Appendix II : Legendre Transformations 

A convex function of a single variable f(x) is one for which f"(x) > everywhere. The Legendre transform of a 
convex function f{x) is a function g(p) defined as follows. Let p be a real number, and consider the line y = px, 
as shown in Fig. 2.43. We define the point x(p) as the value of x for which the difference F(x,p) = px — f(x) is 
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y = f(x) 




y = px 



, y=px-g(p) = }'{x[p)) 



x 



Figure 2.43: Construction for the Legendre transformation of a function f(x). 



greatest. Then define g(p) = F(x(p),p)P The value x(p) is unique if f(x) is convex, since x(p) is determined by 
the equation 

f{x{p))=p. (2.465) 



Note that from p = f (x(p)) we have, according to the chain rule, 

d 



dp 



f'(x(p))=f"(x(p)) x'{p) 



x'(p)= f"{x(p)) 



(2.466) 



From this, we can prove that g(p) is itself convex: 



g'ip) = -j- Px(p) - f(x(p)) 

= px{p) + x{p) - f(x(p)) x'(p) = x(p) 



hence 



g"(p)=x'(p)= f"(x( P )) 



> 



In higher dimensions, the generalization of the definition f"(x) > is that a function F(x 1 , 
the matrix of second derivatives, called the Hessian, 



Hij(x) 



8 2 F 
dx i dxj 



(2.467) 

(2.468) 
is convex if 

(2.469) 



is positive definite. That is, all the eigenvalues of Hij (x) must be positive for every x. We then define the Legendre 
transform G(p) as 

G(p) = px F{x) (2.470) 

where 



p = VF 



(2.471) 



25 Note that g(p) may be a negative number, if the line y = px lies everywhere below f(x). 
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Note that 

dG = x ■ dp + p ■ dx - VF • dx = x ■ dp , (2.472) 
which establishes that G is a function of p and that 

Note also that the Legendre transformation is self dual, which is to say that the Legendre transform of G{p) is F(x): 
F — > G — » F under successive Legendre transformations. 

We can also define a partial Legendre transformation as follows. Consider a function of q variables F(x, y), where 
x = {x 1 ,..., x m } and y = {y 1 , y n }, with q = m + n. Define p = {p 1 , . . . ,p m }, and 

G(p,y)=px-F(x,y), (2.474) 

where 

P a = §f (a = l,...,m). (2.475) 



These equations are then to be inverted to yield 

9Pa 



x a = x a {p,y) = i- ■ (2.476) 



Note that 



Thus, from the chain rule, 



which says 



dF 

P a = ^{x{p,y),y) . (2.477) 

5 _dp a _ d 2 F dx c _ d^F d*G (247g) 

ab dp b dx a dx c dp b dx a dx c dp c dp b ' 

K:, 1 , (2.479) 



d Pa d Pb dp 

where the m x m partial Hessian is 



dx a dx b dx b 

Note that K ab = K ba is symmetric. And with respect to the y coordinates, 

d 2 G d 2 F 



d 2 F dp 

K b ■ (2-480) 



U , (2.481) 



dy^ dy v dy^ dy u ' 
where 

d 2 F 

L M „ = jr^r- (2.482) 
oy^ ay v 

is the partial Hessian in the y coordinates. Now it is easy to see that if the full 9x5 Hessian matrix is positive 
definite, then any submatrix such as K ab or must also be positive definite. In this case, the partial Legendre 
transform is convex in { Pl , . . . ,p m } and concave in {y l7 . . . , y n }. 
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2.17 Appendix III : Useful Mathematical Relations 



Consider a set of n independent variables {x 1 , . . . , x n }, which can be thought of as a point in n-dimensional space. 
Let {Vi, ■ ■ ■ ,y n } and {z l7 . . . , z n } be other choices of coordinates. Then 



dx{ dxi dyj 
dz k dy j dz k 



(2.483) 



Note that this entails a matrix multiplication: A ik = C- k , where A ik = dx i /dz k , = dx i /dyj, and C- fe 
dy j /dz k . We define the determinant 

( dxA _ d(x 1 ,...,x n ) 
\9zJ 



det 



'-k/ d(z 1 ,...,z n ) 

Such a determinant is called a Jacobean. Now if A = BC, then det (A) = det(i?) • det(C). Thus, 

d(x l ,...,x n ) d{x 1 ,...,x n ) d(y 1 ,...,y n ) 



9(zi,...,z n ) d(y 1 ,...,y n ) d(z u ...,z n ) 



Recall also that 



9xj 
dx u 



ik ' 



(2.484) 



(2.485) 



(2.486) 



Consider the case n = 2. We have 

d(x,y) 



/(9x) 
/ \duJv 



d(u, v) 



det 



'dx) 
v dv J u 



<:) r 
dtt 



We also have 



f dy\ ( dy_ 
\\du) v {OvJ q 

d(x,y) d(u,v) d(x,y) 



<9(it, v) d{r, s) d(r, s) 
From this simple mathematics follows several very useful results. 

1 ) First, write 



Now let y = v. 



Thus, 



2 ) Second, we have 



which is to say 



d{x,y) 
d(u, v) 

d(x,y) = 
d(u,y) 

dx 



d(u, v) 
d(x,y) 



dx 
du 



Idu 
y \d: 



— \ 

xhj 



dx 
dv 



dy 



d{x,y) _ 

d(u,y) \duj y d(x,u) d(u,y) 



dx\ d(x,y) d(x,u) 



9y\ ( dx\ 
du) x \dy) u 



dx\ ( dy\ ( dx 

dy)u \ 9u )x \ du 



(2.487) 



(2.488) 



(2.489) 



(2.490) 



(2.491) 



(2.492) 
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Invoking eqn. 2.491, we conclude that 



3) Third, we have 



which says 



(¥) =-!• (2-493) 



(2.494) 



d(x, v) d(x,v) d(y,v) 
d(u,v) d{y,v) d(u,v) ' 



dx\ f dx\ f dy 
du ) v ~ \ d v)v\ du 



(2.495) 



This is simply the chain rule of partial differentiation. 
4) Fourth, we have 

(dx,dy) (dx,dy) (du,dv) 



(du, dy) (du, dv) (du, dy) 

' dx\ ( dy\ ( dv\ I dx\ ( dy\ f dv 



duj v \dvj u \dyj u \dv J u \duJ v \dy 
which says 

' dx\ ( dx\ ( dx\ f dy 



(2.496) 



(2.497) 



5 ) Fifth, whenever we differentiate one extensive quantity with respect to another, holding only intensive quanti- 
ties constant, the result is simply the ratio of those extensive quantities. For example, 

The reason should be obvious. In the above example, S(p, V, T) = V<fr(p, T), where <p is a function of the two 
intensive quantities p and T. Hence differentiating S with respect to V holding p and T constant is the same as 
dividing S by V. Note that this implies 



(2.499) 



dv) p ^-\dv) p ^-\dv) ntT -V" 
where n = N/V is the particle density. 
6) Sixth, suppose we have a function E(y, v) and we write 

dE = xdy + udv . (2.500) 

That is, 



— ) =E u = (— 

dy ) v y \dv 



E y , u=( — )=E v . (2.501) 



Writing 



dx = E yy dy + E yv dv (2.502) 
du = E vy dy + E vv dv , (2.503) 
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and demanding du = yields 



Note that i? = £7 . From the equation du = we also derive 



VV 



(2.505) 

9 « /« E w 



Next, we use eqn. 2.503 with du = to eliminate dy in favor of dv, and then substitute into eqn. 2.502. This yields 

« u /u vy 



= E vv - %^ . (2.506) 



Finally eqn. 2.503 with dv = yields 

'dy\ 1 



<9u / 



(2.507) 



Combining the results of eqns. 2.504, 2.505, 2.506, and 2.507, we have 

d(x,y) f dx\ f dy\ ( dx\ ( dy s 



d(u,v) \duj v \dvj u \ dv J \ du 



KyJ\ KyJ \ Ky / \Ky , 



(2.508) 



Thus, 



Nota bene: It is important to understand what other quantities are kept constant, otherwise we can run into 
trouble. For example, it would seem that eqn. 2.508 would also yield 

WV) = 1 ■ (2 - 510) 

But then we should have 

d(n,N) d(p,V) d(ji,N) y 1 

when according to eqn. 2.508 it should be —1. What has gone wrong? 

The problem is that we have not properly specified what else is being held constant. In eqn. 2.509 it is N (or u) 
which is being held constant, while in eqn. 2.510 it is S (or T) which is being held constant. Therefore a naive 
application of the chain rule for determinants yields the wrong result, as we have seen. 

Let's be more careful. Applying the same derivation to dE = x dy+u dv + r ds and holding s constant, we conclude 

d(x,y,s) _ f dx\ ( dy\ ( dx\ ( dy\ (2 511) 



d(u,v,s) \duj v \dvj \dvj \du /v 
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Thus, the appropriate thermodynamic relations are 

d(T,S,N) _ d(T,S,») 
d(y,X,N) d(y,X,fi) 

d(T, S,X) d(T, S,y) 

dQi,N,X) d(ji,N,y) 

d(y,X,S) _ d(y,X,T) 
d{fx,N,S) d{fx,N,T) 

For example, if we add (//, N) to the mix, we should write 

d{T,S,N) _ d(- P ,V,S) _ dQi,N,V) 

d(-p,V,N) d{ft,N,S) d(T,S,V) 



(2.512) 



= -1 



(2.513) 



If we are careful, then the general result in eq. 2.512, where (y, X) = (— p, V) or (H a , M a ) or (E a , P a ), can be 
quite handy, especially when used in conjunction with eqn. 2.485. For example, we have 



dS\ d(T,S,N) _ d(T,S,N) d(p,V,N) = f dp\ 
dV) TN d{T,V,N) d(p,V,N)'d{T,V,N) \dT ) VN ' 

which is one of the Maxwell relations derived from the exactness of dF. Some other examples: 

(dV\ _ d(V,p,N) _ d(V,p,N) d(S,T,N) _ (dT\ 
\9SJp,N ~ d(S,p,N) ~ d(S,T,N) ' d(S, P ,N) ~ \dp~J SiN 

dS\ d{S,T,p) = d(S,T,p) d( P ,N,p) _(dv\ 
dNj Tp d(N,T,p) d(vL,N,p)' d(N,T,p) \dTJ P , N ' 



(2.514) 



(2.515) 
(2.516) 



Note that due to the alternating nature of the determinant - it is antisymmetric under interchange of any two rows 
or columns - we have 

d(x,y,z) = d(y,x,z) = d(y,x,z) (2 517) 

d(u,v,w) d(u,v,w) d(w,v,u) 

In general, it is usually advisable to eliminate S from a Jacobean. If we have a Jacobean involving T, S, and N, we 
can write 

d(T,S,N) d(T,S,N) d{p,V,N) _ d{p,V,N) 



d(»,m,N) d(p,V,N) d(»,»,N) d(»,»,N)' 
where each • is a distinct arbitrary state variable other than N. 
If our Jacobean involves the S, V, and N, we write 

d(S,V,N) _ d(S,V,N) d(T,V,N) _C V d(T,V,N) 



(2.518) 



d(.,*,N) d(T,V,N) d(.,.,N) T d(.,.,N) 



(2.519) 



If our Jacobean involves the S, p, and N, we write 



d(S,p,N) d(S,p,N) d(T,p,N) C p d(T,p,N) 



d(*,*,N) d(T,p,N) d(*,*,N) T 9(.,.,iV) 



(2.520) 
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For example, 

(dT\ _ d(T, S, N) _ d(T, S, N) d( P , V, N) djp, T, N) _ T_ /dV\ 
\9p)s,N ~ d(p,S,N) ~ d( P ,V,N) ' d(p,T,N) ' d(p,S,N) ~ C~ \&f ) pN 
fdV_\ _ d{V, S, N) _ diy, S, N) d(V, T, N) d(p,T,N) _ C v (dV\ 
\9p) S> n ~ d(p,S,N) ~ d(V,T,N) ' d(p,T,N) ' d(p, S,N) ~ ~C~ \ dp~ ) Tt 



Chapter 3 

Ergodicity and the Approach to 
Equilibrium 

3.1 References 

- R. Balescu, Equilibrium and Nonequilibrium Statistical Mechanics (Wiley, 1975) 
An advanced text with an emphasis on fluids and kinetics. 

- R. Balian, From Macrophysics to Microphysics (2 vols., Springer- Verlag, 2006) 

A very detailed discussion of the fundamental postulates of statistical mechanics and their implications.) 
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3.2 Modeling the Approach to Equilibrium 

3.2.1 Equilibrium 

A thermodynamic system typically consists of an enormously large number of constituent particles, a typical 
'large number' being Avogadro's number, = 6.02 x 10 23 . Nevertheless, in equilibrium, such a system is char- 
acterized by a relatively small number of thermodynamic state variables. Thus, while a complete description of a 
(classical) system would require us to account for O (10 23 ) evolving degrees of freedom, with respect to the phys- 
ical quantities in which we are interested, the details of the initial conditions are effectively forgotten over some 
microscopic time scale r, called the collision time, and over some microscopic distance scale, I, called the mean 
free path 1 . The equilibrium state is time-independent. 



3.2.2 The Master Equation 

Relaxation to equilibrium is often modeled with something called the master equation. Let P^t) be the probability 
that the system is in a quantum or classical state i at time t. Then write 

-i = H ( w ij p i w a p i) ■ (3-D 

3 

Here, is the rate at which j makes a transition to i. Note that we can write this equation as 

dt 

where 



r„- -T* 0.3) 

where the prime on the sum indicates that k = j is to be excluded. The constraints on the are that > for 
all and we may take W u = (no sum on i). Fermi's Golden Rule of quantum mechanics says that 

W ij = ^-\{i\V\j)\ 2 p{E j ), (3.4) 

where H Q | i ) = E i | i ), V is an additional potential which leads to transitions, and p(E i ) is the density of final 
states at energy E v The fact that W i} > means that if each P 4 (i = 0) > 0, then P^t) > for all t > 0. To see this, 
suppose that at some time t > one of the probabilities P L is crossing zero and about to become negative. But 
then eqn. 3.1 says that -Pj(t) = J2j W^Pfo) > 0. So P^t) can never become negative. 



3.2.3 Equilibrium distribution and detailed balance 



If the transition rates are themselves time-independent, then we may formally write 

P i (t) = {e- rt ) ij P j (0)- ( 3 - 5 ) 



1 Exceptions involve quantities which are conserved by collisions, such as overall particle number, momentum, and energy. These quantities 
relax to equilibrium in a special way called hydrodynamics. 
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Here we have used the Einstein 'summation convention' in which repeated indices are summed over (in this case, 
the j index). Note that 

i 

which says that the total probability ^\ P i is conserved: 

We conclude that <fi = (1, 1, . . . , 1) is a left eigenvector of T with eigenvalue A = 0. The corresponding right 
eigenvector, which we write as P^ q , satisfies ivpji = 0, and is a stationary (i.e. time independent) solution to the 
master equation. Generally, there is only one right /left eigenvector pair corresponding to A = 0, in which case 
any initial probability distribution P, (0) converges to P, eq as t — > oo, as shown in Appendix I (§3.7). 

In equilibrium, the net rate of transitions into a state | i ) is equal to the rate of transitions out of | i ). If, for each 
state | j ) the transition rate from \i) to \j) is equal to the transition rate from \j) to | i ), we say that the rates 
satisfy the condition of detailed balance. In other words, 

W ij Pp = W ji Pp. (3.8) 

Assuming =/= and Pj q =^ 0, we can divide to obtain 

W- P cq 

— 21 = (3 9) 

W pcq ■ V>.V) 

y t 

Note that detailed balance is a stronger condition than that required for a stationary solution to the master equa- 
tion. 

If r = r l is symmetric, then the right eigenvectors and left eigenvectors are transposes of each other, hence 
P cq = 1/N , where N is the dimension of r. The system then satisfies the conditions of detailed balance. See 
Appendix II (§3.8) for an example of this formalism applied to a model of radioactive decay. 

3.2.4 Boltzmann's H-theorem 

Suppose for the moment that r is a symmetric matrix, i.e. IV = IV. Then construct the function 

H (f )=£ P« (t) In P^t). (3.10) 



Then 



-r = ? —r (! + ln P, =y—r 



(3.11) 



dt ^ dt K u ^ dt 

i i 

id 

= Y,r iJ P ] (lnP J -lnP l ), 
where we have used £\ P^ = 0. Now switch i <-> j in the above sum and add the terms to get 

^ = \H r iA P i- P i) (lnP 4 -InP,) ■ (3.12) 
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Note that the i = j term does not contribute to the sum. For i ^ j we have P^ = —W^ < 0, and using the result 

(x-y) (lnz-lny) > , (3.13) 

we conclude 



dH r 
— < . 

dt ~ 

In equilibrium, P* q is a constant, independent of i. We write 



PI 



cq _ 



n 



n = E 1 



H = -lnft . 



(3.14) 



(3.15) 



If Fjj 7^ J"-,-, we can still prove a version of the H-theorem. Define a new symmetric matrix 



and the generalized H -function, 



Then 



dH 
~dt 



In 



< 



(3.16) 



(3.17) 



(3.18) 



3.3 Phase Flows in Classical Mechanics 



3.3.1 Hamiltonian evolution 



The master equation provides us with a semi-phenomenological description of a dynamical system's relaxation to 
equilibrium. It explicitly breaks time reversal symmetry. Yet the microscopic laws of Nature are (approximately) 
time-reversal symmetric. How can a system which obeys Hamilton's equations of motion come to equilibrium? 

Let's start our investigation by reviewing the basics of Hamiltonian dynamics. Recall the Lagrangian L = L(q,q,t) = 
T — V . The Euler-Lagrange equations of motion for the action S \q(t)] = Jdt L are 



. _ d ( dL\ _ dL 
V ° Jt\WJ W a ' 

where p a is the canonical momentum conjugate to the generalized coordinate q a : 

dL 



Pa 



dq a 



(3.19) 



(3.20) 



The Hamiltonian, H(q,p) is obtained by a Legendre transformation, 

r 

H(q,p) = XX ^ ~ L ■ 



(3.21) 
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Note that 



,rr / ,. • , 9i , dL ,. \ dL , 



<T=1 

Thus, we obtain Hamilton's equations of motion, 



dq a ^ dq a ^) dt 

at 



dH 


= % 


dH 


dL 




dq a 


dq„ 




dH 


dH 


dL 




~dt 


dt 


dt ' 



Define the rank 2r vector cp by its components, 



,Pi- r Hr < i <2r 
Then we may write Hamilton's equations compactly as 

dH 



dipj 

where 



(3.22) 



-Pa (3-23) 

upa uq a uq a 

and 



(3.24) 



q i if 1 < i < r 

<Pi = < (3-25) 



<Pi = J ij ~ETT i (3.26) 



J = f ° rxr lrxr j (3.27) 
is a rank 2r matrix. Note that J' = — J, i.e. J is antisymmetric, and that J 2 = -l 2r x2r- 

3.3.2 Dynamical systems and the evolution of phase space volumes 

Consider a general dynamical system, 

^ = , (3-28) 

where tp(t) is a point in an n-dimensional phase space. Consider now a compact 2 region 1Z in phase space, and 
consider its evolution under the dynamics. That is, 1Z consists of a set of points {<p \ (f € Hq}, and if we regard 
each ip G TZ Q as an initial condition, we can define the time-dependent set lZ(t) as the set of points cp(t) that were 
in TZ at time t = 0: 

K(t) = { v (t) | tp(0) € K } . (3.29) 
Now consider the volume fi(t) of the set 7£(i). We have 

Cl(t) = Jdfi (3.30) 

K(t) 



'Compact' in the parlance of mathematical analysis means 'closed and bounded'. 
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where 

dn = dip 1 dip 2 ■■■ dip n , 
for an n-dimensional phase space. We then have 

LfJL 

K{t+dt) K{t) 

where 



fi(f + dt)= J dfi' = J dp 



dip^t + dt) 



dcpJt) 



dp jit) 



<<P* 



and therefore 



<Pi(t + dt) = Vi (t) + V t (<p(t)) dt + 0(dt 2 ) 
d ^ {t + dt) =S, n + ^dt + 0(dt*). 



We now make use of the equality 
for any matrix M , which gives us 3 , for small e, 



IndetM = Tr InM 



Thus, 



which says 



det(l + eA) =cx P Tr \n(l + eA) = 1 + e Tr A + \ e 2 [{ Tr A) - Tr (A 2 ) 
Q(t + dt) = fl(t) + JdfiV-V dt + 0{dt 2 ) , 



dQ 
~dt 



n(t) 

duV-V = JdSh-V 

n{t) dn(t) 



Here, the divergence is the phase space divergence, 



(3.31) 



(3.32) 



(3.33) 



is a determinant, which is the Jacobean of the transformation from the set of coordinates Up i = to the 

coordinates = (p^t + dt)}. But according to the dynamics, we have 



(3.34) 
(3.35) 
(3.36) 

(3.37) 
(3.38) 

(3.39) 



(3.40) 



and we have used the divergence theorem to convert the volume integral of the divergence to a surface integral 
of n ■ V, where n is the surface normal and dS is the differential element of surface area, and dlZ denotes the 
boundary of the region 1Z. We see that if V- V = everywhere in phase space, then fi(t) is a constant, and phase 
space volumes are preserved by the evolution of the system. 

For an alternative derivation, consider a function g(ip,t) which is defined to be the density of some collection of 
points in phase space at phase space position ip and time t. This must satisfy the continuity equation, 



^ + V-( C V) = 



(3.41) 



3 The equality lndet M = Tr InM is most easily proven by bringing the matrix to diagonal form via a similarity transformation, and 
proving the equality for diagonal matrices. 
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t = u 





t = u 



t = u 



Figure 3.1: Time evolution of two immiscible fluids. The local density remains constant. 



This is called the continuity equation. It says that 'nobody gets lost'. If we integrate it over a region of phase space 
1Z, we have 

j t JdiiQ=- Jd^V-(gV) = - JdS h- {gV) . (3.42) 

K K dTZ 

It is perhaps helpful to think of g as a charge density, in which case J = gV is the current density. The above 
equation then says 



dt 



dS n ■ J , 



(3.43) 



cm 



where Q n is the total charge contained inside the region 1Z. In other words, the rate of increase or decrease of the 
charge within the region 7Z is equal to the total integrated current flowing in or out of 7Z at its boundary. 



The Leibniz rule lets us write the continuity equation as 

dg 



dt 



V-Vg+gV-V = . 



But now suppose that the phase flow is divergenceless, i.e. V- V = 0. Then we have 



(3.44) 



(3.45) 



The combination inside the brackets above is known as the convective derivative. It tells us the total rate of change 
of q for an observer moving with the phase flow. That is 



= V v. dp i dg = Dg 
^ 1 deft dt Dt 



(3.46) 
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If Dg/Dt = 0, the local density remains the same during the evolution of the system. If we consider the 'charac- 
teristic function' 

^,* = 0)=(; ***** (3.47) 
I otherwise 

then the vanishing of the convective derivative means that the image of the set 7Z under time evolution will 
always have the same volume. 

Hamiltonian evolution in classical mechanics is volume preserving. The equations of motion are 

9H . dH 

cn = +^- , Pi=-JT (3-48) 

A point in phase space is specified by r positions q i and r momenta p it hence the dimension of phase space is 

n = 2r: 

Hamilton's equations of motion guarantee that the phase space flow is divergenceless: 

dpi 
dpi 



(—) 


d t 


8H\ 


\dpr) 


dpt \ 


dqi] 



(3.50) 



r 

= E 

i=l 

Thus, we have that the convective derivative vanishes, viz. 

g-| + V.^.O. (3,1) 

for any distribution g{(p, t) on phase space. Thus, the value of the density g(<p(t), t) is constant, which tells us that 
the phase flow is incompressible. In particular, phase space volumes are preserved. 



3.3.3 Liouville's equation and the microcanonical distribution 

Let g(tp) = g(q, p) be a distribution on phase space. Assuming the evolution is Hamiltonian, we can write 

dg . __ >A / . d . d \ t 

- = -^.V Q = -^[ qk — + Pk —J e = - lLe , (3.52) 
where L is a differential operator known as the Liouvillian: 

L- -i'S" {— — - — — \ (3 53) 

^ • dp k dq k dq k dp k l ' 



k=l 



Eqn. 3.52, known as Liouville's equation, bears an obvious resemblance to the Schrodinger equation from quantum 
mechanics. 

Suppose that A a (ip) is conserved by the dynamics of the system. Typical conserved quantities include the com- 
ponents of the total linear momentum (if there is translational invariance), the components of the total angular 
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momentum (if there is rotational invariance), and the Hamiltonian itself (if the Lagrangian is not explicitly time- 
dependent). Now consider a distribution g((p, t) = g(A 1 , A 2 , ■ ■ ■ , A k ) which is a function only of these various 
conserved quantities. Then from the chain rule, we have 

* ■ Vg = E |f * ' VAa = ' (3 - 54) 

a a 

since for each a we have 

dA n sr^ ( dA n . dA n , \ . „ , ^ 

We conclude that any distribution g(tp, t) = g(A 1 ,A 2 , . . . , A k ) which is a function solely of conserved dynamical 
quantities is a stationary solution to Liouville's equation. 

Clearly the microcanonical distribution, 

6{E-H(<p)) 5{E-H(<p)) 
gM = D(E) = J dfi S(E-H{ V )) ' (3 ' 56) 

is a fixed point solution of Liouville's equation. 



3.4 Irreversibility and Poincare Recurrence 

The dynamics of the master equation describe an approach to equilibrium. These dynamics are irreversible: 
dY\/dt < 0, where H is Boltzmann's H-function. However, the microscopic laws of physics are (almost) time- 
reversal invariant 4 , so how can we understand the emergence of irreversibility? Furthermore, any dynamics 
which are deterministic and volume-preserving in a finite phase space exhibits the phenomenon of Poincare recur- 
rence, which guarantees that phase space trajectories are arbitrarily close to periodic if one waits long enough. 



3.4.1 Poincare recurrence theorem 

The proof of the recurrence theorem is simple. Let g T be the 'r-advance mapping' which evolves points in phase 
space according to Hamilton's equations. Assume that g T is invertible and volume-preserving, as is the case for 
Hamiltonian flow. Further assume that phase space volume is finite. Since energy is preserved in the case of 
time-independent Hamiltonians, we simply ask that the volume of phase space at fixed total energy E be finite, i.e. 

I dfi 6 (E - H(q,p)) <oo , (3.57) 

where d^i = dq dp is the phase space uniform integration measure. 

Theorem: In any finite neighborhood 1Z of phase space there exists a point cp which will return to 7Z after m 
applications of g T , where m is finite. 

Proof: Assume the theorem fails; we will show this assumption results in a contradiction. Consider the set T 
formed from the union of all sets 7Z for all m: 

oo 

T=\Jg k T K (3.58) 

4 Actually, the microscopic laws of physics are not time-reversal invariant, but rather are invariant under the product PCT, where P is 
parity, C is charge conjugation, and T is time reversal. 
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Figure 3.2: Successive images of a set TZo under the r-advance mapping g T , projected onto a two-dimensional 
phase plane. The Poincare recurrence theorem guarantees that if phase space has finite volume, and g T is invertible 
and volume preserving, then for any set TZo there exists an integer m such that TZo H g™ TZo ^ 0- 



We assume that the set {g^TZ \k € N} is disjoint 5 . The volume of a union of disjoint sets is the sum of the 
individual volumes. Thus, 

oo 

vol(T) = 5>ol( ffT fc ^ ) 

(3.59) 

= voi c^o) • 1 = 00 > 

fe=0 

since vol(g^ TZ ) = vol(TZ ) from volume preservation. But clearly T is a subset of the entire phase space, hence 
we have a contradiction, because by assumption phase space is of finite volume. 

Thus, the assumption that the set {g* TZ \ k £ Z + } is disjoint fails. This means that there exists some pair of integers 
k and /, with k ^ I, such that g k TZ Q D g l T TZ Q ^ 0. Without loss of generality we may assume k < I. Apply the 
inverse g^ 1 to this relation k times to get g l ~ k TZ n TZ ^ 0. Now choose any point ip 1 e g™ TZ n TZ , where 
m = I — k, and define <p = g~ m ip 1 . Then by construction both (p and <p lie within TZ and the theorem is 
proven. 

Poincare recurrence has remarkable implications. Consider a bottle of perfume which is opened in an otherwise 
evacuated room, as depicted in fig. 3.3. The perfume molecules evolve according to Hamiltonian evolution. 
The positions are bounded because physical space is finite. The momenta are bounded because the total energy is 
conserved, hence no single particle can have a momentum such that T(p) > E TOT , where T{p) is the single particle 
kinetic energy function 6 . Thus, phase space, however large, is still bounded. Hamiltonian evolution, as we have 
seen, is invertible and volume preserving, therefore the system is recurrent. All the molecules must eventually 
return to the bottle. What's more, they all must return with momenta arbitrarily close to their initial momenta! 7 

5 The natural numbers N is the set of non-negative integers {0, 1,2,...}. 

6 In the nonrelativistic limit, T = p 2 /2m. For relativistic particles, we have T = (p 2 c 2 + rn 2 c 4 ) 1/l2 — mc 2 . 

7 Actually, what the recurrence theorem guarantees is that there is a configuration arbitrarily close to the initial one which recurs, to within 
the same degree of closeness. 
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t = t = Is £ >> H 

Figure 3.3: Poincare recurrence guarantees that if we remove the cap from a bottle of perfume in an otherwise 
evacuated room, all the perfume molecules will eventually return to the bottle! (Here H is the Hubble constant.) 

In this case, we could define the region 1Z as 

K = {{q 1 ,...,q r ,p 1 ,..., Pr ) | \q z -q°\ < Ag and ^ < ApVi,j} , (3.60) 
which specifies a hypercube in phase space centered about the point (q°,p°). 

Each of the three central assumptions - finite phase space, invertibility, and volume preservation - is crucial. If 
any one of these assumptions does not hold, the proof fails. Obviously if phase space is infinite the flow needn't 
be recurrent since it can keep moving off in a particular direction. Consider next a volume-preserving map which 
is not invertible. An example might be a mapping / : R — > R which takes any real number to its fractional part. 
Thus, /(tt) = 0.14159265 . . .. Let us restrict our attention to intervals of width less than unity. Clearly / is then 
volume preserving. The action of / on the interval [2, 3) is to map it to the interval [0, 1). But [0, 1) remains fixed 
under the action of /, so no point within the interval [2, 3) will ever return under repeated iterations of /. Thus, / 
does not exhibit Poincare recurrence. 

Consider next the case of the damped harmonic oscillator. In this case, phase space volumes contract. For a one- 
dimensional oscillator obeying x + 2(3x + S7q x = one has V- V = —2/3 < 0, since j3 > for physical damping. 
Thus the convective derivative is D t g = —(W-V)g = 2{3g which says that the density increases exponentially in 
the comoving frame, as g(t) = e 2f3t g(0). Thus, phase space volumes collapse: 0(i) = e~ 2/32 0(0), and are not 
preserved by the dynamics. The proof of recurrence therefore fails. In this case, it is possible for the set T to be of 
finite volume, even if it is the union of an infinite number of sets 7Z , because the volumes of these component 
sets themselves decrease exponentially, as vol(g™ TZ ) = e~ 2n P T vol(TZ ). A damped pendulum, released from rest 
at some small angle 6 , will not return arbitrarily close to these initial conditions. 

3.4.2 Kac ring model 

The implications of the Poincare recurrence theorem are surprising - even shocking. If one takes a bottle of 
perfume in a sealed, evacuated room and opens it, the perfume molecules will diffuse throughout the room. The 
recurrence theorem guarantees that after some finite time T all the molecules will go back inside the bottle (and 
arbitrarily close to their initial velocities as well). The hitch is that this could take a very long time, e.g. much much 
longer than the age of the Universe. 

On less absurd time scales, we know that most systems come to thermodynamic equilibrium. But how can a 
system both exhibit equilibration and Poincare recurrence? The two concepts seem utterly incompatible! 
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t = n t=n+l 

Figure 3.4: Left: A configuration of the Kac ring with iV = 16 sites and F = 4 flippers. The flippers, which live 
on the links, are represented by blue dots. Right: The ring system after one time step. Evolution proceeds by 
clockwise rotation. Spins passing through flippers are flipped. 



A beautifully simple model due to Kac shows how a recurrent system can exhibit the phenomenon of equilibra- 
tion. Consider a ring with TV sites. On each site, place a 'spin' which can be in one of two states: up or down. 
Along the TV links of the system, F of them contain 'flippers'. The configuration of the flippers is set at the outset 
and never changes. The dynamics of the system are as follows: during each time step, every spin moves clockwise 
a distance of one lattice spacing. Spins which pass through flippers reverse their orientation: up becomes down, 
and down becomes up. 

The 'phase space' for this system consists of 2 N discrete configurations. Since each configuration maps onto a 
unique image under the evolution of the system, phase space 'volume' is preserved. The evolution is invertible; 
the inverse is obtained simply by rotating the spins counterclockwise. Figure 3.4 depicts an example configuration 
for the system, and its first iteration under the dynamics. 

Suppose the flippers were not fixed, but moved about randomly. In this case, we could focus on a single spin and 
determine its configuration probabilistically. Let p n be the probability that a given spin is in the up configuration 
at time n. The probability that it is up at time (n + 1) is then 

p n+ l = (1 - x)p n + x (1 - p n ) , (3.61) 

where x = F/N is the fraction of flippers in the system. In words: a spin will be up at time (n + 1) if it was up at 
time n and did not pass through a flipper, or if it was down at time n and did pass through a flipper. If the flipper 
locations are randomized at each time step, then the probability of flipping is simply x = F/N. Equation 3.61 can 
be solved immediately: 

Pn = | + (1 - 2*)" (po - |) , (3.62) 
which decays exponentially to the equilibrium value of p cq = \ with time scale 

^=~w^v (3 - 63) 

We identify t(x) as the microscopic relaxation time over which local equilibrium is established. If we define the 
magnetization m = (N^ — N,)/N, then m = 2p — 1, so m n = (1 — 2x) n mo. The equilibrium magnetization is 
m eq = 0. Note that for | < x < 1 that the magnetization reverses sign each time step, as well as decreasing 
exponentially in magnitude. 
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Figure 3.5: Three simulations of the Kac ring model with N = 2500 sites and three different concentrations of 
flippers. The red line shows the magnetization as a function of time, starting from an initial configuration in which 
100% of the spins are up. The blue line shows the prediction of the Stosszahlansatz, which yields an exponentially 
decaying magnetization with time constant r. 



The assumption that leads to equation 3.61 is called the Stosszahlansatz 8 , a long German word meaning, approx- 
imately, 'assumption on the counting of hits'. The resulting dynamics are irreversible: the magnetization inex- 
orably decays to zero. However, the Kac ring model is purely deterministic, and the Stosszahlansatz can at best 
be an approximation to the true dynamics. Clearly the Stosszahlansatz fails to account for correlations such as the 
following: if spin i is flipped at time n, then spin i + 1 will have been flipped at time n—l. Also if spin i is flipped 
at time n, then it also will be flipped at time n + N. Indeed, since the dynamics of the Kac ring model are invertible 
and volume preserving, it must exhibit Poincare recurrence. We see this most vividly in figs. 3.5 and 3.6. 

The model is trivial to simulate. The results of such a simulation are shown in figure 3.5 for a ring of N = 1000 
sites, with F = 100 and F = 24 flippers. Note how the magnetization decays and fluctuates about the equilibrium 
value m eq = 0, but that after N iterations m recovers its initial value: m N — m . The recurrence time for this 
system is simply N if F is even, and 2N if F is odd, since every spin will then have flipped an even number of 



8 Unfortunately, many important physicists were German and we have to put up with a legacy of long German words like Gedankenexperi- 
ment, Zitterbeivegimg, Brehmsstrahhmg, Stosszahlansatz, Kartojfelsalat, etc. 
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Figure 3.6: Simulations of the Kac ring model. Top: TV = 2500 sites with F = 201 flippers. After 2500 iterations, 
each spin has flipped an odd number of times, so the recurrence time is 2N. Middle: N = 2500 with F = 2400, 
resulting in a near-complete reversal of the population with every iteration. Bottom: N = 25000 with N = 1000, 
showing long time equilibration and dramatic resurgence of the spin population. 



times. 



In figure 3.6 we plot two other simulations. The top panel shows what happens when x > \, so that the magneti- 
zation wants to reverse its sign with every iteration. The bottom panel shows a simulation for a larger ring, with 
N = 25000 sites. Note that the fluctuations in m about equilibrium are smaller than in the cases with N = 1000 
sites. Why? 



3.5 Remarks on Ergodic Theory 
3.5.1 Definition of ergodicity 

A mechanical system evolves according to Hamilton's equations of motion. We have seen how such a system is 
recurrent in the sense of Poincare. 
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There is a level beyond recurrence called ergodicity. In an ergodic system, time averages over intervals [0, T] with 
T — ► oo may be replaced by phase space averages. The time average of a function f{f) is defined as 



T 

1 



(f(<p)) T = T lim - Jdtf{tp(t)) . (3.64) 
o 

For a Hamiltonian system, the phase space average of the same function is defined by 

(f(<p)) s = jdpf{v)8{E-H{ip)) I Jd[i6(E-H(<p)) , (3.65) 

where H(ip) = H(q, p) is the Hamiltonian, and where S(x) is the Dirac (5-function. Thus, 

ergodicity (f(<p)) T = (f(<p)) s , (3-66) 

for all smooth functions f(<p) for which {f(f)) s exists and is finite. Note that we do not average over all of phase 
space. Rather, we average only over a hypersurface along which H(tp) = E is fixed, i.e. over one of the level 
sets of the Hamiltonian function. This is because the dynamics preserves the energy. Ergodicity means that almost 
all points tp will, upon Hamiltonian evolution, move in such a way as to eventually pass through every finite 
neighborhood on the energy surface, and will spend equal time in equal regions of phase space. 

Let X n (ip) be the characteristic function of a region 7Z: 



otherwise, 



where H (<p) = E for all if e 1Z. Then 



If the system is ergodic, then 



/ / n\ /time spent in 1Z \ ,„ 
(Xn(<P)) T = r l™ ( ^ J • (3-68) 

<X K M) T = P(^) = ^^, (3.69) 

where P(TZ) is the a priori probability to find (p E 1Z, based solely on the relative volumes of 7Z and of the entire 
phase space. The latter is given by 

D(E)= fdn8(E-H(<p)) , (3.70) 
called the density of states, is the surface area of phase space at energy E, and 

D n {E) = Jdp5(E-H( V )) . (3.71) 
n 

is the density of states for the phase space subset TZ. Note that 

D(E)^JdpS(E- H{<p)) = (3-72) 
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Figure 3.7: Constant phase space velocity at an irrational angle over a toroidal phase space is ergodic, but not 
mixing. A circle remains a circle, and a blob remains a blob. 



Here, dS is the differential surface element, S E is the constant H hypersurface H(ip) = E, and VL{E) is the volume 
of phase space over which H (ip) < E. Note also that we may write 



where 



is the the invariant surface element. 



dS 



dp, = dE dS E 
dS 



Vffl 



(3.74) 
(3.75) 



H(ip)=E 



3.5.2 The microcanonical ensemble 

The distribution, 



S(E-H( V )) 6(E-H(<p)) 
Qe{<P) = D(E) = f dli 6(E-H( V )) ' (376) 



defines the microcanonical ensemble (^CE) of Gibbs. 
We could also write 



integrating over the hypersurface S E rather than the entire phase space. 



3.5.3 Ergodicity and mixing 

Just because a system is ergodic, it doesn't necessarily mean that g(ip,t) — ► g cq ((p), for consider the following 
motion on the toroidal space {tp = (q,p) \ < q < 1 , < p < l}, where we identify opposite edges, i.e. we 
impose periodic boundary conditions. We also take q and p to be dimensionless, for simplicity of notation. Let the 
dynamics be given by 

q = 1 , p = a. (3.78) 

The solution is 

q(t)=q +t , p(t)=p + at, (3.79) 



hence the phase curves are given by 



P = P + a (l - %) 



(3.80) 
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Figure 3.8: The baker's transformation is a successive stretching, cutting, and restacking. 

Now consider the average of some function f(q,p). We can write f(q,p) in terms of its Fourier transform, 

f(q,p) = J2frn n e 2ni{mq+np) ■ (3-81) 

We have, then, 

f(q(t), P (t)) = J2 Iran e^ m ^ n ^) e 2*i(m+an)t _ (3 _g 2) 



We can now perform the time average of /: 



(f{q,p)) T = Zoo + lirn ^ ^ f mn 



2-ni(mq +np a ) 



T ^°° T ^ " 2m(m + an) (3 g 3) 

= / 00 if a irrational. 

Clearly, 

l l 

(Mp)) s = [dq fdpf(q,p) = f 00 = {f(q, P )) T , (3.84) 







so the system is ergodic. 

The situation is depicted in fig. 3.7. If we start with the characteristic function of a disc, 

g(q, p, t = 0) = 6 (a 2 - (q - q Q ) 2 - (p - p ) 2 ) , (3.85) 
then it remains the characteristic function of a disc: 

g(q, p,t) = Q (a 2 - {q - q - t) 2 - (p - p - at) 2 ) , (3.86) 

A stronger condition one could impose is the following. Let A and B be subsets of S E . Define the measure 

v{A) = JdS E X A (<P) j ' jd£ E = , (3.87) 
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n =5 n = 3 



Figure 3.9: The multiply iterated baker's transformation. The set A covers half the phase space and its area is 
preserved under the map. Initially, the fraction of B covered by A is zero. After many iterations, the fraction of B 
covered by g n A approaches \. 



where X A (ip) is the characteristic function of A. The measure of a set A is the fraction of the energy surface S E 
covered by A. This means v{S E ) = 1, since S E is the entire phase space at energy E. Now let g be a volume- 
preserving map on phase space. Given two measurable sets A and B, we say that a system is mixing if 



mixing 



lim u(g n A n b) = v(A) v(B) . 

n — »oo \ / 



(3.88) 



In other words, the fraction of B covered by the n th iterate of A, i.e. g n A, is, as n — * oo, simply the fraction of S E 
covered by A. The iterated map g n distorts the region A so severely that it eventually spreads out 'evenly' over 
the entire energy hypersurface. Of course by 'evenly' we mean 'with respect to any finite length scale', because at 
the very smallest scales, the phase space density is still locally constant as one evolves with the dynamics. 



Mixing means that 



</(¥>)> 



d(J-g(ip,t)f((p) 

dtif(<p)6(E-H(<p)) I Jdfx8(E-H(<p)) 



(3.89) 



= Tr 



f(cp)5(E-H(<p)) Tr 6{E - H(<p)) 



Physically, we can imagine regions of phase space being successively stretched and folded. During the stretching 
process, the volume is preserved, so the successive stretch and fold operations map phase space back onto itself. 

An example of a mixing system is the baker's transformation, depicted in fig. 3.8. The baker map is defined by 

(2g,ip) if 0<g<i 



(3.90) 



( 2g -l,I p+ I) if !<<?<!. 
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original 1 3 132 155 157 



200 211 240 275 299 300 

Figure 3.10: The Arnold cat map applied to an image of 150 x 150 pixels. After 300 iterations, the image repeats 
itself. (Source: Wikipedia) 



Note that g is invertible and volume-preserving. The baker's transformation consists of an initial stretch in which q 
is expanded by a factor of two and p is contracted by a factor of two, which preserves the total volume. The system 
is then mapped back onto the original area by cutting and restacking, which we can call a 'fold'. The inverse 
transformation is accomplished by stretching first in the vertical (p) direction and squashing in the horizontal (q) 
direction, followed by a slicing and restacking. Explicitly, 

\{\q,2p) if 0< P <I 

9-\q,p) = I (3-91) 
{{\q+\,2p-l) if \<P<1. 



Another example of a mixing system is Arnold's 'cat map' 9 

g(q,p) = ([q + p], [q + 2p]) , (3.92) 
where [x] denotes the fractional part of x. One can write this in matrix form as 

M 

The matrix M is very special because it has integer entries and its determinant is dct M = 1. This means that the 
inverse also has integer entries. The inverse transformation is then 



mod Z 2 . (3.94) 



Now for something cool. Suppose that our image consists of a set of discrete points located at (n 1 /k , n 2 /k), 
where the denominator k G Z is fixed, and where n 1 and n 2 range over the set {1, . . . , k}. Clearly g and its inverse 

'The cat map gets its name from its initial application, by Arnold, to the image of a cat's face. 
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Figure 3.11: The hierarchy of dynamical systems. 



preserve this set, since the entries of M and M 1 are integers. If there are two possibilities for each pixel (say off 
and on, or black and white), then there are 2^"' possible images, and the cat map will map us invertibly from 
one image to another. Therefore it must exhibit Poincare recurrence! This phenomenon is demonstrated vividly 
in fig. 3.10, which shows a k = 150 pixel (square) image of a cat subjected to the iterated cat map. The image is 
stretched and folded with each successive application of the cat map, but after 300 iterations the image is restored! 
How can this be if the cat map is mixing? The point is that only the discrete set of points {n 1 /k , n 2 /k) is periodic. 
Points with different denominators will exhibit a different periodicity, and points with irrational coordinates will 
in general never return to their exact initial conditions, although recurrence says they will come arbitrarily close, 
given enough iterations. The baker's transformation is also different in this respect, since the denominator of the 
p coordinate is doubled upon each successive iteration. 

The student should now contemplate the hierarchy of dynamical systems depicted in fig. 3.11, understanding the 
characteristic features of each successive refinement 10 . 



3.6 Thermalization of Quantum Systems 
3.6.1 Quantum dephasing 

Thermalization of quantum systems is fundamentally different from that of classical systems. Whereas time evolu- 
tion in classical mechanics is in general a nonlinear dynamical system, the Schrodinger equation for time evolution 
in quantum mechanics is linear: 

a* 

ih— = , (3.95) 

where H is a many-body Hamiltonian. In classical mechanics, the thermal state is constructed by time evolution 
- this is the content of the ergodic theorem. In quantum mechanics, as we shall see, the thermal distribution must 
be encoded in the eigenstates themselves. 

Let us assume an initial condition at t = 0, 

|*(0)> = X;CJ* a >, (3.96) 

a 

10 There is something beyond mixing, called a K -system. A if-system has positive Kolmogorov-Sinai entropy. For such a system, closed 
orbits separate exponentially in time, and consequently the Liouvillian L has a Lebesgue spectrum with denumerably infinite multiplicity. 
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where { | $ a ) } is an orthonormal eigenbasis for H satisfying H |\P a ) = E a \^ a )The expansion coefficients satisfy 
c a = (*cl*(0)) and J2 a \ c a\ 2 = !• Normalization requires 

(*(0)|*(0)H^|CJ 2 = 1. (3.97) 

a 

The time evolution of |^) is then given by 

\*(t))=Y / C a e- lE " t/h \* a ) . (3.98) 

a 

The energy is distributed according to the time-independent function 

P(E) = < | S(E H) | ) = \C a ? 5{E - E a ) . (3.99) 

Ot 

Thus, the average energy is time-independent and is given by 

oo 

(E) = (*(t)\H\V(t))= jdEP{E)E = Y^\C a \ 2 E a . (3.100) 

-oo 

The root mean square fluctuations of the energy are given by 



(AE) rms = ((E (E)f) 1/2 = J2 \ C ^ E l {j2\ C a \ 2E a ) 2 ■ (3-101) 

y a a 

Typically we assume that the distribution P(E) is narrowly peaked about (E), such that (AE) lms <C E — E , 
where E Q is the ground state energy. Note that P(E) = for E < E , i.e. the eigenspectrum of H is bounded from 
below. 

Now consider a general quantum observable described by an operator A. We have 

(A(t)) = ( *(t) | A | *(*) )=J2 c :°P e^-" W A a0 , (3.102) 

where A a g = (^ a \A\^f g ). In the limit of large times, we have 

T 

(A) T = lira i fdt (A(t)) = £ \C a | 2 A aa . (3.103) 

1 — 'OO 1 I * * 

o 

Note that this implies that all coherence between different eigenstates is lost in the long time limit, due to dephasing. 
3.6.2 Eigenstate thermalization hypothesis 

The essential ideas behind the eigenstate thermalization hypothesis (ETH) were described independently by J. Deutsch 
(1991) and by M. Srednicki (1994). The argument goes as follows. If the total energy is the only conserved quan- 
tity, and if A is a local, translationally-invariant, few-body operator, then the time average (A) is given by its 
microcanonical value, 

<A)t = £ \CjA aa = ^y^j)^ = <A)r, (3-104) 
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where I = [E, E + AE~\ is an energy interval of width AE. So once again, time averages are micro canonical 
averages. 

But how is it that this is the case? The hypothesis of Deutsch and of Srednicki is that thermalization in isolated and 
bounded quantum systems occurs at the level of individual eigenstates. That is, for all eigenstates |* Q ) with E a e I, 
one has 

A aa = (A) Ea . (3.105) 

This means that thermal information is encoded in each eigenstate. This is called the eigenstate thermalization hypothesis 
(ETH). 

An equivalent version of the ETH is the following scenario. Suppose we have an infinite or extremely large 
quantum system U (the 'universe') fixed in an eigenstate \^ a ). Then form the projection operator P a = \^ a )(^ a \- 
Projection operators satisfy P 2 = P and their eigenspectrum consists of one eigenvalue 1 and the rest of the 
eigenvalues are zero 11 . Now consider a partition of U = W U S, where W ^> S. We imagine S to be the 'system' 
and W the 'world'. We can always decompose the state \& a ) in a complete product basis for W and S, viz. 

i*-> = E E Qpj K) ® wf) • ( 3 - 106 ) 

p=i j=i 

Here N w , s is the size of the basis for W/S. The reduced density matrix for S is defined as 

N s / N w \ 

The claim is that p s is a thermal density matrix on S, i.e. 

p s = ±e~^, (3.108) 

where H s is the Hamiltonian restricted to S, and Z s = Tr e~" H s, so that Tr p s = 1 and p s is properly normalized. 
Here (3 = l/k B T with T the temperature. The temperature is fixed by the requirement that Tr(p s H s ) = E a ■ 
(Ys /^u)' where the last factor is a ratio of volumes. 



3.6.3 When is the ETH true? 

There is no rigorous proof of the ETH. Deutsch showed that the ETH holds for the case of an integrable Hamilto- 
nian weakly perturbed by a single Gaussian random matrix. Horoi et at (1995) showed that nuclear shell model 
wavefunctions reproduce thermodynamic predictions. Recent numerical work by M. Rigol and collaborators has 
verified the applicability of the ETH in small interacting boson systems. ETH fails for so-called integrable models, 
where there are a large number of conserved quantities, which commute with the Hamiltonian. Integrable models 
are, however, quite special, and as Deutsch showed, integrability is spoiled by weak perturbations, in which case 
ETH then applies. 

Note again that in contrast to the classical case, time evolution of a quantum state does not create the thermal state. 
Rather, it reveals the thermal distribution which is encoded in all eigenstates after sufficient time for dephasing to 
occur, so that correlations between all the wavef unction expansion coefficients {C a } for a^a' are all lost. 

11 More generally, we could project onto a _ft"-dimensional subspace, in which case there would be K eigenvalues of +1 and N — K eigen- 
values of 0, where N is the dimension of the entire vector space. 
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3.7 Appendix I : Formal Solution of the Master Equation 

Recall the master equation P i = — Jy P-. The matrix is real but not necessarily symmetric. For such a matrix, 
the left eigenvectors cj)f and the right eigenvectors ipj are not the same: general different: 

r /? x /J (3109) 

Note that the eigenvalue equation for the right eigenvectors is Tip = Xtp while that for the left eigenvectors is 
.TV = \<j). The characteristic polynomial is the same in both cases: 

F{\) = dct (A- r) = dot (A- r*) , (3.110) 

which means that the left and right eigenvalues are the same. Note also that [-F(A)] * = F(\*), hence the eigenval- 
ues are either real or appear in complex conjugate pairs. Multiplying the eigenvector equation for <j> a on the right 
by i/jj and summing over j, and multiplying the eigenvector equation for ip@ on the left by cjjf and summing over 
i, and subtracting the two results yields 

(\ a -\ p )(r\^) = o, (3.1H) 

where the inner product is 

i 

We can now demand 

(F\l> f> ) = 8 afit (3.113) 

in which case we can write 



We have seen that = (1, 1, . . . , 1) is a left eigenvector with eigenvalue A = 0, since J2i ^ij = 0- We do n °t know 
a priori the corresponding right eigenvector, which depends on other details of F^ . Now let's expand P i (t) in the 
right eigenvectors of r, writing 

^(*)=X)C r a(*)^ ? - ( 3 - 115 ) 



Then 



This allows us to write 



Hence, we can write 



dt 



dt ^ dt Vi 

a 

= - r ii P j = -Y, C « r iiW (3.H6) 

a 

-\ a C a =► a a (t) = C a (0)e- A « t . (3.117) 
P i (t) = E C 'a(0)e- A » t V l a - (3.H8) 
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It is now easy to see that Re(A Q ) > for all A, or else the probabilities will become negative. For suppose 
Re (A a ) < for some a. Then as t — ► oo, the sum in eqn. 3.118 will be dominated by the term for which A Q has 
the largest negative real part; all other contributions will be subleading. But we must have J2 i ipf = since | ip a ) 

must be orthogonal to the left eigenvector (f> a=0 = (1,1,..., 1). Therefore, at least one component of ipf ( z - e - f° r 
some value of i) must have a negative real part, which means a negative probability! 12 As we have already proven 
that an initial nonnegative distribution {P^t = 0)} will remain nonnegative under the evolution of the master 
equation, we conclude that P^t) — ► P° q as t — ► oo, relaxing to the A = right eigenvector, with Re (A a ) > for all 
a. 



3.8 Appendix II : Radioactive Decay 

Consider a group of atoms, some of which are in an excited state which can undergo nuclear decay. Let P n (t) be 
the probability that n atoms are excited at some time t. We then model the decay dynamics by 

if m > n 

W mn = { m if m = n - 1 (3.119) 
if m < n — 1 . 

Here, 7 is the decay rate of an individual atom, which can be determined from quantum mechanics. The master 
equation then tells us 

dP 

-^ = { n + l) 1 P n+1 -n 1 P n . (3.120) 

The interpretation here is as follows: let | n ) denote a state in which n atoms are excited. Then P„ (t) = \{ip(t) | n ) | 2 . 
Then P n (t) will increase due to spontaneous transitions from | n+ 1 ) to | n ), and will decrease due to spontaneous 
transitions from | n ) to | n — 1 ) . 

The average number of particles in the system is 

00 

N(t) = J2^P n (t)- (3.121) 

Note that 

dN 



00 

J2[n(n-l)P n -n 2 P n ] (3.122) 



= 7 

n=0 
00 



n=0 

Thus, 

N(t) = N{0) e" 7 * . (3.123) 
The relaxation time is r = 7 _1 , and the equilibrium distribution is 

= <U ■ (3-124) 



12 Since the probability P. (t) is real, if the eigenvalue with the smallest (i.e. largest negative) real part is complex, there will be a corresponding 
complex conjugate eigenvalue, and summing over all eigenvectors will result in a real value for P i (t). 
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Note that this satisfies detailed balance. 
We can go a bit farther here. Let us define 



P(z,t) = J2z n P n (t). (3.125) 



n=0 



This is sometimes called a generating function. Then 

dP 



dt 

n=0 



7 ^z"[(n + l)P n+1 -nP n 



dP dP 

= 7 

Thus, 



(3.126) 



T a ^ z 'a 

oz oz 



1 dP dP 

-^--(l-z)^-=0. (3.127) 

7 ot oz 

We now see that any function /(£) satisfies the above equation, where £ = 7t — ln(l — z). Thus, we can write 

P(«,t) = /(7t-ln(l-«)). (3.128) 

Setting i = we have P(z, 0) = /( — ln(l — zfj, and inverting this result we obtain f(u) = P(l — e~ u , 0), i.e. 

P(z,t) = P(l + (z- l)e" 7 *, 0) . (3.129) 

The total probability is P(z = l,t) = J2^Lo which clearly is conserved: P(l, t) = P(l, 0). The average particle 
number is 

BP 

= e-T P(l, 0) = N(0) e" 7 ' . (3.130) 

z=l 



N{t) = Y. nP ^) 

n=0 



3.9 Appendix III : Canonical Transformations in Hamiltonian Mechanics 

The Euler-Lagrange equations of motion of classical mechanics are invariant under a redefinition of generalized 
coordinates, 

Q<r = Qa(Ql>---><lr> t ) > (3.131) 

called a point transformation. That is, if we express the new Lagrangian in terms of the new coordinates and their 
time derivatives, viz. 

L(Q, Q, t) = L(q(Q, t) , q(Q, Q, t) , t) , (3.132) 
then the equations of motion remain of the form 



dL d ( dL 



dQa dt\dQ a 

Hamilton's equations 13 , 



(3.133) 



q* = T— , P* = -jr- ( 3 - 134 ) 



13 We revert to using H for the Hamiltonian in this section, rather than H as before. 
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are invariant under a much broader class of transformations which mix all the q's and p's, called canonical trans- 
formations. The general form for a canonical transformation is 



% = la (Ql , ■ ■ ■ , Qr , P l > ■ ■ ■ , P r > *) 
Pa = Pa {Ql J • • • J Qr > P U • • • i P r J *) j 

with (7 G { 1, . . . , r} . We may also write 



with i G {1, . . . , 2r}. Here we have 

6 



g- if 1 < i < r 
Pi-r \in < i <2r 



Qi 



if 1 < i < r 
if r < i < 2r 



(3.135) 
(3.136) 

(3.137) 
(3.138) 



The transformed Hamiltonian is H (Q, P, t). 

What sorts of transformations are allowed? Well, if Hamilton's equations are to remain invariant, then 



which gives 



dH 



P~ = - 



dH 







dQo 



(3.139) 



(3.140) 



I.e. the flow remains incompressible in the new (Q, P) variables. We will also require that phase space volumes 
are preserved by the transformation, i.e. 



(3.141) 



(f)- 


d(Q,P) 




d(q,p) 



det 

This last condition guarantees the invariance of the phase space measure 

r 

dfi = hT r JJ dq a dp a , 

(7=1 

where h in the normalization pref actor is Planck's constant. 



(3.142) 



Chapter 4 



Statistical Ensembles 
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- F. Reif, Fundamentals of Statistical and Thermal Physics (McGraw-Hill, 1987) 

This has been perhaps the most popular undergraduate text since it first appeared in 1967, and with good 
reason. 

- A. H. Carter, Classical and Statistical Thermodynamics 
(Benjamin Cummings, 2000) 

A very relaxed treatment appropriate for undergraduate physics majors. 

- D. V. Schroeder, An Introduction to Thermal Physics (Addison- Wesley, 2000) 

This is the best undergraduate thermodynamics book I've come across, but only 40% of the book treats 
statistical mechanics. 
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Remarkably crisp, though dated, this text is organized as a series of brief discussions of key concepts and 
examples. Published by Dover, so you can't beat the price. 

- M. Kardar, Statistical Physics of Particles (Cambridge, 2007) 

A superb modern text, with many insightful presentations of key concepts. 

- M. Plischke and B. Bergersen, Equilibrium Statistical Physics (3 rd edition, World Scientific, 2006) 

An excellent graduate level text. Less insightful than Kardar but still a good modern treatment of the subject. 
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- E. M. Lifshitz and L. P. Pitaevskii, Statistical Physics (part I, 3 rd edition, Pergamon, 1980) 

This is volume 5 in the famous Landau and Lifshitz Course of Theoretical Physics. Though dated, it still 
contains a wealth of information and physical insight. 
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4.2 Microcanonical Ensemble (/iCE) 
4.2.1 The microcanonical distribution function 

We have seen how in an ergodic dynamical system, time averages can be replaced by phase space averages: 

ergodicity (f(<p)) T = (f(<p)) s , (4.1) 



where 



i 

(f(<p)) T =]im o ±Jdtf(<p(t)). (4.2) 



and 



</(¥>)> s = jdnf{<p)5{E -&{#))/ Jd^5(E-H{ i p)) . 



(4.3) 



Here H(ip) = H(q,p) is the Hamiltonian, and where S(x) is the Dirac 5-f unction 1 . Thus, averages are taken over 
a constant energy hypersurface which is a subset of the entire phase space. 

We've also seen how any phase space distribution q(A 1 ,A k ) which is a function of conserved quantified A a (cp) 
is automatically a stationary (time-independent) solution to Liouville's equation. Note that the microcanonical 
distribution, 

QB (<p)=5(E-H(<p)) j Jd^5{E-H( V )) , (4.4) 

is of this form, since H(<p) is conserved by the dynamics. Linear and angular momentum conservation generally 
are broken by elastic scattering off the walls of the sample. 

So averages in the microcanonical ensemble are computed by evaluating the ratio 

(A) = "*" r A5(E — H) 
w Tr S(E-H) 

where Tr means 'trace', which entails an integration over all phase space: 

Here N is the total number of particles and d is the dimension of physical space in which each particle moves. 
The factor of 1/Nl, which cancels in the ratio between numerator and denominator, is present for indistinguishable 
particles. The normalization factor (2-K?i)~ Nd renders the trace dimensionless. Again, this cancels between numer- 
ator and denominator. These factors may then seem arbitrary in the definition of the trace, but we'll see how they 
in fact are required from quantum mechanical considerations. So we now adopt the following metric for classical 
phase space integration: 

d " = M g ■ (4-7) 



1 We write the Hamiltonian as H (classical or quantum) in order to distinguish it from magnetic field (H) or enthalpy (H). 
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4.2.2 Density of states 

The denominator, 

D(E) = Tr 5(E - H) , (4.8) 
is called the density of states. It has dimensions of inverse energy, such that 

E+AE 

D(E)AE= JdE' Jd^S(E' - H) = J (4.9) 

E E<H<E+AE 

— # of states with energies between E and E + AE . 
Let us now compute D{E) for the nonrelativistic ideal gas. The Hamiltonian is 

N 



=1 



We assume that the gas is enclosed in a region of volume V, and we'll do a purely classical calculation, neglecting 
discreteness of its quantum spectrum. We must compute 



°w=wJuMi £ -tM- (411) 



We'll do this calculation in two ways. First, let's rescale pf = \j2mE uf. We then have 



D W = 1^ 4 ld M u6(ul + ul + ... + ul I -l). (4.12) 



AT! I h I 



E 



Here we have written u = (u 1 , u 2 , . . . , u M ) with M = Nd as a Ai-dimensional vector. We've also used the rule 
5{Ex) = E~ 1 5(x) for ^-functions. We can now write 

d M u = u M - l dudQ M , (4.13) 



where is the i\/-dimensional differential solid angle. We now have our answer: 



.2 



What remains is for us to compute Q M , the total solid angle in M dimensions. We do this by a nifty mathematical 
trick. Consider the integral 

oo 

T M = Jd M u e-" 2 = Q M Jdu u* 1 - 1 e- u2 

(4-15) 

= kn M jdss 1 * M -\-° = \n M T(\M), 



2 The factor of i preceding Q M in eqn. 4.14 appears because <5(« 2 — 1) = i 5(u — 1) + 4 S(u + 1). Since u = \u\ > 0, the second term can 
be dropped. 
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where s = u 2 , and where 

oo 

T(z) = jdtt^e- 1 (4.16) 
o 

is the Gamma function, which satisfies zT(z) = T(z + l). 3 On the other hand, we can compute T M in Cartesian 
coordinates, writing 

M 

i m = I ld Ul e-<\ =(V^) M . (4.17) 



Therefore 

2tt m / 2 

r(A//2) 



n M = TT7Y77^ ■ (4-18) 



We thereby obtain Q 2 = 2tt, f2 3 = Air, Q 4 = 2n 2 , etc., the first two of which are familiar. 
Our final result, then, is 

V N ( m \ Nd/2 E^- 1 

Here we have emphasized that the density of states is a function of E, V , and N. Using Stirling's approximation, 

In TV! = TV In TV - N + \\nN + ± ln(27r) + 0{N- 1 ) , (4.20) 
we may define the statistical entropy, 

S(E,V,N) = k s ]nD(E,V,N)=Nk B J^, ^)+0(\nN), (4.21) 



where 

Recall k B = 1.3806503 x 10~ 16 erg/K is Boltzmann's constant. 

The second way to calculate D(E) is to first compute its Laplace transform, Z{0): 

oo 

Z(J3)=£ [D{E)} = JdE e' pE D(E) = Tr e~ fiA . (4.23) 
o 

The inverse Laplace transform is then 

e+ioo 

D(E) = C- 1 [Z{fij\ = J e? E Z(P) , (4.24) 

c—ioo 

where c is such that the integration contour is to the right of any singularities of Z(j3) in the complex /3-plane. We 



3 Note that for integer argument, T(k) = (k — 1)! 
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had 
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1 
1 

1 . 
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1 ' 

\ 

\ 







Nd even 



-Re/3 



/ r 

branch cut \ 



Im/3 



Nd odd 



■Re/3 



Figure 4.1: Complex integration contours C for inverse Laplace transform C 1 [Z(/3)] = D(E). When the product 
dN is odd, there is a branch cut along the negative Rc (3 axis. 



then have 



_ 1 TT (44 




Nd 



Pp 2 /2m 



The inverse Laplace transform is then 

D(E) 



-( — ) R- Nd ' 2 
N\ \2itK 2 ) 1 



N\ {27rh 2 J J 2m P 
c 

yN / m E hNd-l 

Nl \2TTh 2 ) T(Nd/2) ' 



(4.25) 



(4.26) 



exactly as before. The integration contour for the inverse Laplace transform is extended in an infinite semicircle in 
the left half /3-plane. When Nd is even, the function fi~ Nd / 2 has a simple pole of order Nd/2 at the origin. When 
Nd is odd, there is a branch cut extending along the negative Re (3 axis, and the integration contour must avoid 
the cut, as shown in fig. 4.1. 

For a general system, the Laplace transform, Z(/3) = C[D(E)] also is called the partition function. We shall again 
meet up with Z(/3) when we discuss the ordinary canonical ensemble. 



4.2.3 Arbitrariness in the definition of S(E) 



Note that D{E) has dimensions of inverse energy, so one might ask how we are to take the logarithm of a di- 
mensionful quantity in eqn. 4.21. We must introduce an energy scale, such as AE in eqn. 4.9, and define 
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D(E; AE) = D(E) AE and S(E; AE) = k B \nD{E; AE). The definition of statistical entropy then involves the 
arbitrary parameter AE, however this only affects S(E) in an additive way. That is, 

S(E,V,N;AE 1 )=S(E,V,N;AE 2 )+k B ]n(^j . (4.27) 

Note that the difference between the two definitions of S depends only on the ratio AE 1 / AE 2 , and is independent 
of E, V, and N. 



4.2.4 Ultra-relativistic ideal gas 

Consider an ultrarelativistic ideal gas, with single particle dispersion e(p) = cp. We then have 



. N 

\0 

V N fT(d)Q d ^ N 



N\ \c d h d P d / 

The statistical entropy is S(E, V 7 N) = k B lnD(E, V, N) = Nk B (/>(§, with 



(4.28) 



# ,?-)= d mf ^ + mfn + mf ] + ( d + i) (4.29) 

N'NJ \N J \N J \ (dhc) d 1 



4.2.5 Discrete systems 

For classical systems where the energy levels are discrete, the states of the system | er ) are labeled by a set of 
discrete quantities a 2 , ■ ■ •}, where each variable a { takes discrete values. The number of ways of configuring 
the system at fixed energy E is then 

^.jv)=EW' (430) 

a 

where the sum is over all possible configurations. Here N labels the total number of particles. For example, if 
we have N spin-i particles on a lattice which are placed in a magnetic field H, so the individual particle energy 
is e i = —fj, Ha, where a = ±1, then in a configuration in which particles have a i = +1 and = N — 
particles have a i = —1, the energy is E = (N^ — N^)fi H. The number of configurations at fixed energy E is 

since 7V T/i = f =F 2^H- Tne statistical entropy is S(E, N) = k B In Ct(E, N). 



4.3 The Quantum Mechanical Trace 

Thus far our understanding of ergodicity is rooted in the dynamics of classical mechanics. A Hamiltonian flow 
which is ergodic is one in which time averages can be replaced by phase space averages using the microcanonical 
ensemble. What happens, though, if our system is quantum mechanical, as all systems ultimately are? 
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4.3.1 The density matrix 

First, let us consider that our system S will in general be in contact with a world W. We call the union of S and 
W the universe, U — W U S. Let | TV ) denote a quantum mechanical state of W, and let | n ) denote a quantum 
mechanical state of S. Then the most general wavefunction we can write is of the form 

l*> = E*iV,nl^>®h>. (4.32) 

N,n 

Now let us compute the expectation value of some operator A which acts as the identity within W, meaning 
( TV LA I N' ) = A S NN ,, where A is the 'reduced' operator which acts within S alone. We then have 

(*|i|*>= £ E W (n | A | ri ) 

N,N'n,n' (4.33) 

= Tr(gA) , 

where 

p = EE^«v l n 'X n l ( 4 - 34 ) 

AT n ,n' 

is the density matrix. The time-dependence of £ is easily found: 



iV n,n' 

_ e -iHt/h ~ e +iHt/h 



(435) 



where is the Hamiltonian for the system 5. Thus, we find 

ih^ = [H,g}. (4.36) 

Note that the density matrix evolves according to a slightly different equation than an operator in the Heisenberg 
picture, for which 

8 A 

A(t) = e +tHt/h A e - lHt ' h ifi — = [A, H] = - [H, A] . (4.37) 

For Hamiltonian systems, we found that the phase space distribution g(q,p, t) evolved according to the Liouville 
equation, 

i^ = LQ, (4-38) 

where the Liouvillian L is the differential operator 

L--iY{— — - — — \ (439) 
^ ' dp. dq- dq- dp 1 ' ' 



3 = 1 



Accordingly any distribution g(A 1 , . . .,A k ) which is a function of constants of the motion A a (q,p) is a station- 
ary solution to the Liouville equation: d t g{A 1 , . . . , A k ) = 0. Similarly, any quantum mechanical density matrix 
which commutes with the Hamiltonian is a stationary solution to eqn. 4.36. The corresponding microcanonical 
distribution is 

g E = 5(E-H). (4.40) 
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Figure 4.2: A system S in contact with a 'world' W. The union of the two, universe U = W U S, is said to be the 
'universe'. 

4.3.2 Averaging the DOS 

If our quantum mechanical system is placed in a finite volume, the energy levels will be discrete, rather than 
continuous, and the density of states (DOS) will be of the form 



where {£7} are the eigenvalues of the Hamiltonian H. In the thermodynamic limit, V — ► 00, and the discrete 
spectrum of kinetic energies remains discrete for all finite V but must approach the continuum result. To recover 
the continuum result, we average the DOS over a window of width AE: 



If we take the limit AE — > but with AE ^> SE, where 6E is the spacing between successive quantized levels, we 
recover a smooth function, as shown in fig. 4.3. We will in general drop the bar and refer to this function as D(E). 
Note that 5E ~ 1/D{E) = e ~ N ^ e ^ is (typically) exponentially small in the size of the system, hence if we took 
AE oc V^ 1 which vanishes in the thermodynamic limit, there are still exponentially many energy levels within an 
interval of width AE. 



4.3.3 Coherent states 

The quantum-classical correspondence is elucidated with the use of coherent states. Recall that the one-dimensional 
harmonic oscillator Hamiltonian may be written 




(4.41) 



E+AE 




(4.42) 



E 



rr P 1 1 2 2 

2m z 



(4.43) 




where a and a* are ladder operators satisfying [a, eft] = 1, which can be taken to be 
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D{E) 



D(E) = (D(E)) AE 



D(E) = ^2S(E-E l 




Figure 4.3: Averaging the quantum mechanical discrete density of states yields a continuous curve. 



with I = y/Kj2 muj n . Note that 



q = £ (a + a f ) 



The ground state satisfies a ipo (q) = 0, which yields 
The normalized coherent state \ z } is defined as 

1 1 00 71 

n=0 v n\ 

The overlap of coherent states is given by 

(z 1 \z 2 ) = e~^^ 2 e~^ 2 e s ^ , 



(4.45) 



(4.46) 



(4.47) 



(4.48) 



hence different coherent states are not orthogonal. Despite this nonorthogonality, the coherent states allow a 
simple resolution of the identity 



/d z d 2 z d Rez d Imz 
7T- \ z )( z \ 5 7T- = 
2m l-Kl 7T 

which is straightforward to establish. 

To gain some physical intuition about the coherent states, define 

_ Q UP 

and write \z) = \Q,P). One finds (exercise!) 

i> Q p {q) = {q\z) = (27T1 2 )- 1 / 4 e- lP Q^ 2h e^/ R e-^ 2 / u2 



(4.49) 



(4.50) 



(4.51) 



hence the coherent state ipQ P (q) is a wavepacket Gaussianly localized about q = Q, but oscillating with average 
momentum P. 
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For example, we can compute 



(Q,P\q\Q,P) = (z\£(a + a!)\z) = 2£Rez = Q (4.52) 

h 

2ii 



(Q,P\p\Q,P) = ( z \£-(a- a?)\z) = jhaz = P (4.53) 



as well as 



(Q,P\q 2 \Q,P) = (z\£ 2 (a + (Jf \z) = Q 2 + l 2 (4.54) 
(Q,P\ V 2 \Q,P) = -( Z \^a-Jy\z) = P 2 + ^ . (4.55) 



Thus, the root mean square fluctuations in the coherent state | Q, P ) are 



Aq = £=J- , Ap=- = J— -°, (4.56) 

y 2mw 2i V 2 

and Aq • Ap = ^h. Thus we learn that the coherent state ipg P (q) is localized in phase space, i.e. in both position 
and momentum. If we have a general operator A(q,p), we can then write 

(Q,P\ A(q 7 p) \Q,P) = A(Q, P) + 0{h) , (4.57) 

where A(Q, P) is formed from A(q,p) by replacing q — > Q and p — > P. 
Since 

d 2 z _ d Rez d Imz dQ dP ^ 
we can write the trace using coherent states as 

oo oo 

TrA=^fdQfdP(Q,P\A\Q,P). (4.59) 

— oo — oo 

We now can understand the origin of the factor 2irh in the denominator of each (q. , p i ) integral over classical phase 
space in eqn. 4.6. 

Note that u Q is arbitrary in our discussion. By increasing lu , the states become more localized in q and more plane 
wave like in p. However, so long as ui is finite, the width of the coherent state in each direction is proportional to 
h 1 / 2 , and thus vanishes in the classical limit. 



4.4 Thermal Equilibrium 

Consider two systems in thermal contact, as depicted in fig. 4.4. The two subsystems #1 and #2 are free to exchange 
energy, but their respective volumes and particle numbers remain fixed. We assume the contact is made over a 
surface, and that the energy associated with that surface is negligible when compared with the bulk energies E 1 
and E 2 . Let the total energy be E = E 1 + E 2 . Then the density of states D(E) for the combined system is 



D{E)= UEMEJD^E-EJ 



(4.60) 
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Figure 4.4: Two systems in thermal contact. 

The probability density for system #1 to have energy E 1 is then 

D^D^E-E,) 
P ^ ) = D{E) ■ (461) 

Note that P 1 (E^ is normalized: J dE 1 P 1 (£' 1 ) = 1. We now ask: what is the most probable value of E x 2 We find 
out by differentiating P 1 (E 1 ) with respect to E 1 and setting the result to zero. This requires 

«- 1 dP ^- 9 lnP liEl ) 



P f^ ^ (4.62) 
= w -lnD l{ E l) + —lnD 2{ E-E 1 ). 

Thus, we conclude that the maximally likely partition of energy between systems #1 and #2 is realized when 

dt = e£ ■ (4 - 63) 

This guarantees that 

S(E,E 1 ) = S 1 (E 1 ) + S 2 (E-E 1 ) (4.64) 
is a maximum with respect to the energy E lr at fixed total energy E. 

The temperature T is defined as 

a result familiar from thermodynamics. The difference is now we have a more rigorous definition of the entropy. 
When the total entropy S is maximized, we have that T l = T 2 . Once again, two systems in thermal contact and 
can exchange energy will in equilibrium have equal temperatures. 

According to eqns. 4.22 and 4.29, the entropies of nonrelativistic and ultrarelativistic ideal gases in d space dimen- 
sions are given by 

S m = \Nd fc B In ( ^ ) + Nk B In [ ^ ) + const. (4.66) 



S UR = Ndk B In ( — ) + Nk B In ( — ) + const. . (4.67) 



NJ B \N 

'-) + Nk„ In ( — 

s N ] \N 

Invoking eqn. 4.65, we then have 

E NR = ±Ndk B T , E l]R = Ndk B T. (4.68) 
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We saw that the probability distribution P 1 (E 1 ) is maximized when T l = T 2 , but how sharp is the peak in the 
distribution? Let us write E 1 = E{ + AE lr where E* is the solution to eqn. 4.62. We then have 



\nP 1 (E* 1 +AE 1 ) = \nP 1 (E* 1 ) + ^- d 



2k B dEf 



1 d 2 S, 



where E 2 = E — E\. We must now evaluate 



d 2 S 



dE 2 dE \T 



T 2 \8E / 



V,N 



2K dE 2 



T 2 a, 



(AEj 



where C v = (8E/dT) v N is the heat capacity. Thus, 



P 



where 



C 



p* e -(AE 1 ) 2 /2k B T 2 C v 



a, 



The distribution is therefore a Gaussian, and the fluctuations in AE 1 can now be computed: 



<(A£i) 2 ) 



k„T 2 C\, 



(AE^ - k u TJC v /k 



(4.69) 

(4.70) 

(4.71) 
(4.72) 

(4.73) 



The individual heat capacities C V1 and C V 2 scale with the volumes V 1 and V 2 , respectively. If V 2 3> V lr then 
C V2 3> C v x , in which case C v ps C V1 . Therefore the RMS fluctuations in AE 1 are proportional to the square 
root of the system size, whereas E 1 itself is extensive. Thus, the ratio (A£' 1 ) RMS /£ , 1 oc V^ 1 ! 2 scales as the inverse 
square root of the volume. The distribution P 1 (E 1 ) is thus extremely sharp. 



4.5 Ordinary Canonical Ensemble (OCE) 
4.5.1 Canonical distribution and partition function 

Consider a system S in contact with a world W , and let their union U = W U S be called the 'universe'. The 
situation is depicted in fig. 4.2. The volume V s and particle number N s of the system are held fixed, but the 
energy is allowed to fluctuate by exchange with the world W. We are interested in the limit iV s — > oo, A^ w — > oo, 
with iV s <C N w , with similar relations holding for the respective volumes and energies. We now ask what is the 
probability that S is in a state | n ) with energy E n . This is given by the ratio 



P„ = lim 



D w (E v -E n )AE 



D V (E V )AE 

# of states accessible to W given that E s 
total # of states in U 



E„ 



(4.74) 



Then 



lnP„ = In D W (E V - E n ) - \nD v (E v ) 
= ]nD w (E v ) -]nD v (E v ) - E n 
= -a- l3E n . 



9 In D W (E) 



dE 



(4.75) 



E=E, 
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The constant (3 is given by 

9 In D W (E) 



dE 



k T 



(4.76) 



Thus, we find P n = e a e & B n. The constant a is fixed by the requirement that J2 n P n = 1 ; 

P n = i e -^ , Z(T,V,N)=Y / e-P E n =Tre-^. (4.77) 

n 

We've already met in eqn. 4.23 - it is the Laplace transform of the density of states. It is also called the 
partition function of the system S. Quantum mechanically we can write the ordinary canonical density matrix as 



Tr e-P" 



(4.78) 



Note that [g, H~\ = 0, hence the ordinary canonical distribution is a stationary solution to the evolution equation 
for the density matrix. Note that the OCE is specified by three parameters: T, V, and N. 

4.5.2 The difference between P(E n ) and P n 

Let the total energy of the Universe be fixed at E v . The joint probability density P(E S , E w ) for the system to have 
energy E s and the world to have energy E w is 

P(E S , E w ) - D S (E S ) D W (E W ) 5{E V - E s - E m )/D V (E V ) , (4.79) 

where 

oo 

D V (E V )= JdE s D S (E S ) D W (E V — E s ) , (4.80) 

— oo 

which ensures that J dE s J dE w P(E S ,E W ) = 1. The probability density P(E S ) is defined such that P(E S ) dE s is 
the (differential) probability for the system to have an energy in the range [E s ,E g + dE s ]. The units of P(E S ) are 
To obtain P(E S ), we simply integrate the joint probability density P(E S ,E W ) over all possible values of E w , 
obtaining 

as we have in eqn. 4.74. 

Now suppose we wish to know the probability P n that the system is in a particular state | n) with energy E n . 
Clearly 

= probability that E s € [E n , E n + AE] _ PjEj AE _ D W {E V - E n ) 

" ae~*o # of S states with E s € [E n , E n + AE] D s {E n ) AE D V (E V ) ' ( ' ' 

4.5.3 Averages within the OCE 

To compute averages within the OCE, 



{A)=Tr (g A) = , (4.83) 
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where we have conveniently taken the trace in a basis of energy eigenstates. In the classical limit, we have 

Q(<p) = ^e- f>6( * ) , Z = Tre-^ = [dne-P 6 ^ , (4.84) 
with dfj,= jfi Y\f=i{d d qj d d pj/h d ) for identical particles ('Maxwell-Boltzmann statistics')- Thus, 

{ A) = Me A) = ^m^. (4.85) 

4.5.4 Entropy and free energy 

The Boltzmann entropy is defined by 

S = -k B Tr (g In g) = -k B ]T Pn In P n ■ (4-86) 

n 

The Boltzmann entropy and the statistical entropy S = k B In D{E) are identical in the thermodynamic limit. 
We define the Helmholtz free energy F(T, V, N) as 

F(T, V, N) = -k B T In Z(T, V, N) , (4.87) 



hence 

Therefore the entropy is 



P n = eP F e~P E n , lnP n = (3F-j3E n . (4.88) 



S = -k B Y,P n {PF-(3E n ) 

n 

F (H) 



(4.89) 



T T 



which is to say 
where 

is the average energy. We also see that 



F = E-TS , (4.90) 

E = yP n E n = TrAe ~ 0H (4.91) 
Tr e~P H 



Z = Jre-^ = ^e-^ =► E = = ~ ^Z = . (4.92) 

Thus, F(T, V, N) is a Legendre transform of E(S, V, N), with 

dF = -SdT -pdV + iidN , (4.93) 

which means 
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4.5.5 Fluctuations in the OCE 



In the OCE, the energy is not fixed. It therefore fluctuates about its average value E = (H) . Note that 

BE 2 9.E _ <9 2 lnZ 

Tr ff 2 / Tr Her^_ V (4.95) 

~ Tr e"^ ~ V Tre-^ / 

Thus, the heat capacity is related to the fluctuations in the energy, just as we saw at the end of §4.4: 

«*>-<*>■) (4%) 

For the nonrelativistic ideal gas, we found C v = | Nk B , hence the ratio of RMS fluctuations in the energy to the 
energy itself is 



((A^) 2 ) ^k^Cy _ [Y 



(4.97) 



(H) %Nk B T V Nd 

and the ratio of the RMS fluctuations to the mean value vanishes in the thermodynamic limit. 

The full distribution function for the energy is 

P(£) = (5(8 H)) = Tr S ^~^f PH = \ D(£) e-e £ . (4.98) 

Thus, 

-P[£-TS{£)] 

P(£) = — , (4.99) 

Jd£e-^ £ - TS ^ 

where S(£) = k B \nD(£) is the statistical entropy. Let's write £ = E + S£, where E extremizes the combination 
£ — T S(£), i.e. the solution to T S'(E) = 1, where the energy derivative of S is performed at fixed volume V and 
particle number N. We now expand S(E + S£) to second order in S£ , obtaining 

S£ (S£) 2 

S{E + 8£) = S{E) + Y -^L-+... (4.100) 
Recall that S"(E) = (£) = -j^-. Thus, 

£-TS{£)=E-TS{E) + ^-+0((5£f) . (4.101) 

Applying this to both numerator and denominator of eqn. 4.99, we obtain 4 

[5£f 



P{£) = J\f exp 



2k B T 2 C v 



(4.102) 



where N = (2-Kk B T 2 C v ) 1 ' 2 is a normalization constant which guarantees Jd£ P{£) = 1. Once again, we see that 
the distribution is a Gaussian centered at {£) = E, and of width (A£) RMS = yJk B T 2 C v . This is a consequence of 
the Central Limit Theorem. 

4 In applying eqn. 4.101 to the denominator of eqn. 4.99, we shift £ by E and integrate over the difference S£ = £ — E, retaining terms up 
to quadratic order in 6£ in the argument of the exponent. 
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{K,Pn} 



{E n) P n } 



aw 



E P n dE r 



n 



n 



Figure 4.5: Microscopic, statistical interpretation of the First Law of Thermodynamics. 



and therefore 



4.5.6 Thermodynamics revisited 

The average energy within the OCE is 

n 

n n 

= 3Q-dW , 

n 

dQ = J^E n dP n . 

n 

Finally from P n = Z~ x e~ E n' k B T , we can write 

E n = -KTlnZ - k B T In P n , 



where 



(4.103) 



(4.104) 



(4.105) 
(4.106) 



(4.107) 
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with which we obtain 



dQ = J2E n dP n 

n 

= -k B T In Z ]T dP n -k B T^2 In P n dP n 

n n 

= Td(-k B ^Pn^P n )=TdS. 



(4.108) 



Note also that 



dW = -J2P n dE n 

n 

= ^zZ P n(z2%t dXl ) (4-109) 

n \ i 1 / 

= -Y^P n (n\ — \n)dX i = Y,F i dX i , 

n,i 1 i 

so the generalized force F i conjugate to the generalized displacement dX i is 

*~s*!|~<||>. 

This is the force acting on the system 5 . In the chapter on thermodynamics, we defined the generalized force 
conjugate to X i asy i = —F i . 

Thus we see from eqn. 4.104 that there are two ways that the average energy can change; these are depicted in 
the sketch of fig. 4.5. Starting from a set of energy levels {E n } and probabilities {P n }, we can shift the energies 
to {E' n }. The resulting change in energy (A£') I = — W is identified with the work done on the system. We could 
also modify the probabilities to {P,' } without changing the energies. The energy change in this case is the heat 
absorbed by the system: (A£') n = Q. This provides us with a statistical and microscopic interpretation of the First 
Law of Thermodynamics. 

4.5.7 Generalized susceptibilities 

Suppose our Hamiltonian is of the form 

H = H{\) = H - XQ , (4.111) 
where A is an intensive parameter, such as magnetic field. Then 

Z{\) = Tr er^H -\Q) (4.112) 



and 



But then from Z = e 13 F we have 



~ =/^Tr(Qe-^W)=/3<Q>. (4.113) 
Q(A,T) = (Q) = -(j^ . (4.114) 



5 tn deriving eqn. 4.110, we have used the so-called Feynman-Hellman theorem of quantum mechanics: d{n\H\n) = (n\ dH \n), if \n) is an 
energy eigenstate. 
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Typically we will take Q to be an extensive quantity. We can now define the susceptibility X as 
The volume factor in the denominator ensures that X is intensive. 

It is important to realize that we have assumed here that \H , Q] =0, i.e. the 'bare' Hamiltonian H and the 
operator Q commute. If they do not commute, then the response functions must be computed within a proper 
quantum mechanical formalism, which we shall not discuss here. 

Note also that we can imagine an entire family of observables {Q k } satisfying [Q k , Q k ,] = and [H Q , Q k ~\ = 0, 
for all k and k'. Then for the Hamiltonian 

H(\)=H Q -Y / hQk, (4.H6) 

k 

we have that 

Q k (X,T) = (Q k ) = - f^p\ (4.117) 



and we may define an entire matrix of susceptibilities, 

1 dQ k 1 d 2 F 



^kl 



V dX l V d\ k d\ 



(4.118) 



4.6 Grand Canonical Ensemble (GCE) 

4.6.1 Grand canonical distribution and partition function 

Consider once again the situation depicted in fig. 4.2, where a system S is in contact with a world W, their union 
U = W U S being called the 'universe'. We assume that the system's volume V s is fixed, but otherwise it is allowed 
to exchange energy and particle number with W . Hence, the system's energy E s and particle number N s will 
fluctuate. We ask what is the probability that S is in a state | n ) with energy E n and particle number N n . This is 
given by the ratio 



P., 



lim 



D w (E v -E n ,N v -N n )AE 



D V (E V ,N U )AE 

# of states accessible to W given that E s = E n and N s = N n 
total # of states in U 



(4.119) 



Then 



lnP n = lnD w (E v - E n , N v - N n ) - ]nD u (E u ,N v ) 
= lnD w (E v ,N v )-\nD v (E v ,N v ) 



E, 



d\nD w (E,N) 



dE 

-a - f3E n + f3ptN n . 



-N r 



8\nD w {E,N) 



E=E V 
iV = Af T 



ON 



(4.120) 
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The constants (3 and p are given by 

1 



din D W (E,N) 



p = -fc B T 



as 

din D W (E,N) 



(4.121) 



97V 



E = E U 
N = N T - 



(4.122) 



The quantity p has dimensions of energy and is called the chemical potential. Nota bene: Some texts define the 
'grand canonical Hamiltonian' K as 

K = H-nN . (4.123) 
Thus, P n = er a e~P^ E ™~^ N ^ . Once again, the constant a is fixed by the requirement that J2 n P n = 1: 

P„ = I e-"^--^-) , SOS, V, m) = £ e"^^--^-) - Tr e ~^-^ = Tr e~ p& . (4.124) 

~ n 

Thus, the quantum mechanical grand canonical density matrix is given by 

e -pk 

Q=- M - (4.125) 

Tr e-P K 

Note that [q, k] = 0. 

The quantity 3(T, V, /x) is called the grand partition function. It stands in relation to a corresponding free energy in 
the usual way: 

3(T, V, m) = e~ pf2{T ' v -^ = -fc B T InS , (4.126) 

where fi(T, V, p) is the grand potential, also known as the Landau free energy. The dimensionless quantity z = 
is called the fugacity. 

If [H, N] = 0, the grand potential may be expressed as a sum over contributions from each N sector, viz. 

S(T,V,/x) = ^e^ N Z(T,V,N) . (4.127) 

N 

When there is more than one species, we have several chemical potentials {p a }, and accordingly we define 

£ = (4-128) 

a 

with 5 = Tr e~ l3K as before. 

4.6.2 Entropy and Gibbs-Duhem relation 

In the GCE, the Boltzmann entropy is 

n 

= -K J2 P n fa ~ P E n + /».) (4.X29) 

n 

g (H) P(N) 
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which says 

Q = E -TS - fiN , (4.130) 

where 

E = J2E n P n = Tr {gH) (4.131) 

n 

N = J2 N nPn = Tr (qN) . (4.132) 

n 

Therefore, f2{T, V, fi) is a double Legendre transform of E(S, V, N), with 

dfl = -SdT -pdV - Ndfi , (4.133) 

which entails 

fdn\ fdQ\ fd!i\ 

Since f2(T, V, is an extensive quantity, we must be able to write f2 = Vcu(T, fj,). We identify the function w(T, fj,) 
as the negative of the pressure: 

dV E \dVJ T „ dV 

T - M 71 (4.135) 

dE\ 

= — 7 



dVJ, 



Therefore, 

Q = -pV , p = p{T, n) (equation of state) . (4.136) 

This is consistent with the result from thermodynamics that G = E — TS + pV = [iN. Taking the differential, we 
obtain the Gibbs-Duhem relation, 

dH = -SdT - pdV - Ndfi= -pdV -Vdp => S dT - Vdp + N dfi = . (4.137) 
4.6.3 Generalized susceptibilities in the GCE 

We can appropriate the results from §4.5.7 and apply them, mutatis mutandis, to the GCE. Suppose we have a 
family of observables {Q k } satisfying [Q k , Q k ,] = and [H , Q k ] = and [N a , Q k ] = for all k, k' , and a. 
Then for the grand canonical Hamiltonian 

a k 

we have that 

Q k (\,T) = (Q k ) = - f^pj (4.139) 



and we may define the matrix of generalized susceptibilities, 

1 dQ k 1 d 2 Q 
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4.6.4 Fluctuations in the GCE 



Both energy and particle number fluctuate in the GCE. Let us compute the fluctuations in particle number. We 
have 

, . , TriVe -/3(H-MJV) x g 
N=(N) = = ^— = - — InS . 



Therefore, 



1 dN _ Tr N 2 e-/S(*-/**) / Tr N e -ft&-*&) ' 
/3 5/7 ~ Tr e-PW-v-fi) \ Tr e-^-^N) 



(4.141) 



(4.142) 



Note now that 



(iV) 2 " {dfx ) TV V 
where k t is the isothermal compressibility. Note: 

_ d(N,T,V) 



"T i 



dfx ) TV 



d(fi,T, V) 



(4.143) 



Thus, 



which again scales as V 1 / 2 . 



4.6.5 Gibbs ensemble 



d(N,T, V) d(N,T,p) d(V,T,p) d(N,T,n) 
d{N,T,p) ' d(y,T,p)' d(N, T, fi) 'd<y,T,ii) 

__N 2 fdV\ N 2 



(AN), 



N 



V 



(4.144) 



(4.145) 



Let the system's particle number N be fixed, but let it exchange energy and volume with the world W. Mutatis 
mutandis, we have 

D w (E v -E n ,V v -V n )AEAV 



P„ = lim lim 

AE^O AV^O 



D V (E V ,V V ) AE AV 



(4.146) 



Then 



lnP„ = lnD w (E v -E n ,V v - V n ) - \nD v (E v , V v ) 
= lnD w (E v ,V u ) - ]nD v (E v ,V v ) 



E, 



9 In D W (E,V) 



dE 

-a- pE n -/3pV n 



- V„ 



d]nD w (E,V) 



E=E V 
V = V„ 



dV 



(4.147) 
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The constants j3 and p are given by 



din D W (E,V) 



V = V TT 



P = k B T 



dE 

9 In D W (E,V) 



KT 



dV 



E = E V 
V = V„ 



(4.148) 
(4.149) 



The corresponding partition function is 



Y(T,p,N) =Tr er^+P^ 



Vn 



dVe -PpV Z{T,V,N) = e 



= -0G(T,p,N) 



(4.150) 



where V is a constant which has dimensions of volume. The factor V^, -1 in front of the integral renders Y di- 
mensionless. Note that G(Vq) = G(V ) + fc B Tln(V r '/VJ ) ) / so the difference is not extensive and can be neglected 
in the thermodynamic limit. In other words, it doesn't matter what constant we choose for V since it contributes 
subextensively to G. Moreover, in computing averages, the constant V divides out in the ratio of numerator 
and denominator. Like the Helmholtz free energy, the Gibbs free energy G(T,p,N) is also a double Legendre 
transform of the energy E(S, V, N), viz. 



G = E-TS + pV 
dG = -SdT + Vdp + fidN , 



which entails 



S = 



dG 
dT 



V 



p,N 



dG\ 
dp L N 



dG \ 



(4.151) 



(4.152) 



4.7 Statistical Ensembles from Maximum Entropy 



The basic principle: maximize the entropy, 



(4.153) 



4.7.1 fxCE 

We maximize S subject to the single constraint 

C = E P «- 1 = - 

n 

We implement the constraint C = with a Lagrange multiplier, A = fc B A, writing 

S — — S /c b A C . 

and freely extremizing over the distribution {P n } and the Lagrange multiplier A. Thus, 

5S* =6S-KXSC-k n C6X 



5P n -k B C5\ = 



(4.154) 



(4.155) 



(4.156) 
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We conclude that C = and that 

lnP„ = -(l + A), (4.157) 
and we fix A by the normalization condition Y^„ P n = !• This gives 

P n = i , n = £e(£ + A£-£ n )e(£„-.E). (4.158) 

n 

Note that fl is the number of states with energies between E and E + AE. 
4.7.2 OCE 

We maximize S subject to the two constraints 

C 1=E P «- 1 = ' C2=Y, E n P n- E = - ( 4 - 159 ) 

n n 

We now have two Lagrange multipliers. We write 

2 

S* = S-k B Y^ X j C j, ( 4 - 16 °) 



(4.161) 



and we freely extremize over {P n } and {Cj}. We therefore have 

2 

55* = SS - k B £ (Aj + A 2 P„) 5P„ - k B £ C j 6X J 

n j=l 

2 

= -fc B Y [ ln p n + 1 + Ai + A 2 £„] SP„ - fc B ^ C,. SX j = . 

n j=l 

Thus, C x — C 2 = and 

lnP„ = -(l + A 1+ A 2 P„) . (4.162) 
We define A 2 = j3 and we fix X x by normalization. This yields 

P„=ie-^ , 2T = ^e-^. (4.163) 

n 

4.7.3 GCE 

We maximize S subject to the three constraints 

C i = E P n- 1 = > C 2 = Y E n P n-E = , C 3 = ^7V„P„-7V = 0. (4.164) 

n n n 

We now have three Lagrange multipliers. We write 

3 

S* = S - k B Y Aj C 3 , (4.165) 
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and hence 

3 

5S* = <5S - fc B ]T ( A i + A 2 E n + A 3 N n ) SP n -k B J2 °j 6X i 

n j=l 



(4.166) 



= -k B [lnP„ + 1 + A x + A 2 E n + A 3 N n ] SP n - fc B £ ^ ^ = . 

n j=l 

Thus, = C 2 = C 3 = and 

In P n = - (1 + Aj + A 2 + A 3 iV n ) . (4.167) 
We define A 2 = (3 and A 3 = — and we fix Aj by normalization. This yields 

P n = I e -/9(B„-^J ; 3 = ^6-"^-^. (4.168) 

n 

4.8 Ideal Gas Statistical Mechanics 

The ordinary canonical partition function for the ideal gas was computed in eqn. 4.25. We found 



i N r 

Z(T,V,N) = -J[J 



1 t r f d% d% «„ ? 



l J- J (2irhY 



I3p~/2m 



Yl I [ e -PP 2 /2m \ (4.169) 



Nl \ I 2wh e 



1 fv^ N 



N\ \X d 

where A T is the thermal wavelength: 

\ T = ^/2Trh 2 /mk B T . (4.170) 

The physical interpretation of A T is that it is the de Broglie wavelength for a particle of mass m which has a kinetic 
energy of k B T. 

In the GCE, we have 

oo 

E(T,V,ri = J2e f3flN Z(T,V,N) 



N=0 



1 / Ve !1 l k ' BT \ I Ve^/ k v T 



(4.171) 



N=l 

From 3 = er n l k n T : we have the grand potential is 

f2(T, V, n) = -Vk B T e^ /k * T /\ d T . (4.172) 

Since Q = —pV (see §4.6.2), we have 

p(T, = k B T \ T d e^/ fc B T . (4.173) 

The number density can also be calculated: 



V V\dn), 



X-o-eW* 1 . (4.174) 

T,V 



Combined, the last two equations recapitulate the ideal gas law, pV = Nk B T. 
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4.8.1 Maxwell velocity distribution 

The distribution function for momenta is given by 

1 N 

9W = (jvE*rf))- (4-175) 

i=l 

Note that g(p) = (S(p i — p)) is the same for every particle, independent of its label i. We compute the average 

(A) = Tr (Ae^ 1311 ) / Tr e~@ H . Setting i = 1, all the integrals other than that over p l divide out between numerator 
and denominator. We then have 

_ Jgfri S( Pl - p) e-gP?/ 2 " 
' 9(P) ~ Jd^ e-PPV 2m (4.176) 
= (2Trmk B Ty 3 / 2 e~l 3 P 2 / 2m . 
Textbooks commonly refer to the velocity distribution f(v), which is related to g(p) by 

f(v) d 3 v = g(p) d 3 p . (4.177) 

Hence, 

\ 3/2 

m > -mv 2 /2k B T 



/(«) = [^jTy) W • (4-178) 

This is known as the Maxwell velocity distribution. Note that the distributions are normalized, viz. 

Cd 3 P g(p)= [d 3 vf(v) = l. (4.179) 



If we are only interested in averaging functions of v = \v\ which are isotropic, then we can define the Maxwell 
speed distribution, f(v), as 

3/2 



Note that f(v) is normalized according to 

1dvf(v) = l. (4.181) 



It is convenient to represent v in units of v = y/k^T/m, in which case 

/(«) = -¥>(«/«„) , f{s) = \[l s 2 e- s2 ' 2 . (4.182) 



The distribution <p(s) is shown in fig. 4.6. Computing averages, we have 

oo 

C k = (s k ) = Jdss k ^8) = 2 k ' 2 ■ A p (I + I) . (4.183) 
o 

Thus, C = 1, C\ = yf^, C 2 = 3, etc. The speed averages are 

(v k )=C k (^) k/2 . (4.184) 



Note that the average velocity is (v) = 0, but the average speed is (v) = y/8k B T/Trm. The speed distribution is 
plotted in fig. 4.6. 
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Fi gure 4.6: Maxwell distribution of speeds tp(v/vo). The most probable speed is v MAX 
is v AVG = J | v . The RMS speed = V3v . 



= \/2 vq . The average speed 



4.8.2 Equipartition 



The Hamiltonian for ballistic (i.e. massive nonrelativistic) particles is quadratic in the individual components of 
each momentum p t . There are other cases in which a classical degree of freedom appears quadra tically in H as 
well. For example, an individual normal mode £ of a system of coupled oscillators has the Lagrangian 



L — 2? — 2 W S ' 



(4.185) 



where the dimensions of £ are [£] = M 1 / 2 L by convention. The Hamiltonian for this normal mode is then 



H - y + 2 w o £ 



(4.186) 



from which we see that both the kinetic as well as potential energy terms enter quadratically into the Hamiltonian. 
The classical rotational kinetic energy is also quadratic in the angular momentum components. 

Let us compute the contribution of a single quadratic degree of freedom in H to the partition function. We'll 
call this degree of freedom ( - it may be a position or momentum or angular momentum - and we'll write its 
contribution to H as 

H c = \KC, 2 , (4.187) 
where K is some constant. Integrating over ( yields the following factor in the partition function: 



d( e 



-/3KC /2 



2tt \ 



1/2 



(4.188) 



The contribution to the Helmholtz free energy is then 



AF r = i)fe n Tln ,' 

2 b ^ 2ttKT 



(4.189) 
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and therefore the contribution to the internal energy E is 

We have thus derived what is commonly called the equipartition theorem of classical statistical mechanics: 

To each degree of freedom which enters the Hamiltonian quadratically is associated a contribution 
\k B T to the internal energy of the system. This results in a concomitant contribution of |fc B to the heat 
capacity. 

We now see why the internal energy of a classical ideal gas with / degrees of freedom per molecule is E = 
^fNk B T, and C v = ^Nk B . This result also has applications in the theory of solids. The atoms in a solid possess 
kinetic energy due to their motion, and potential energy due to the spring-like interatomic potentials which tend 
to keep the atoms in their preferred crystalline positions. Thus, for a three-dimensional crystal, there are six 
quadratic degrees of freedom (three positions and three momenta) per atom, and the classical energy should 
be E = 3Nk B T, and the heat capacity C v = 3Nk B . As we shall see, quantum mechanics modifies this result 
considerably at temperatures below the highest normal mode (i.e. phonon) frequency, but the high temperature 
limit is given by the classical value C v = 3vR (where v = N/N A is the number of moles) derived here, known as 
the Dulong-Petit limit. 

4.8.3 Quantum statistics and the Maxwell-Boltzmann limit 

Consider a system composed of N noninteracting particles. The Hamiltonian is 

N 

H = Y,h- ( 4 - 191 ) 

The single particle Hamiltonian h has eigenstates | a) with corresponding energy eigenvalues e Q . What is the 
partition function? Is it 

H= ■■■ J2 e ~^ £ " 1+£a2+ '" + ea ^ =C JV > (4-192) 
where ( is the single particle partition function, 

For systems where the individual particles are distinguishable, such as spins on a lattice which have fixed positions, 
this is indeed correct. But for particles free to move in a gas, this equation is wrong. The reason is that for 
indistinguishable particles the many particle quantum mechanical states are specified by a collection of occupation 
numbers n a , which tell us how many particles are in the single-particle state | a ). The energy is 

E = Y,n a e a (4.194) 

a 

and the total number of particles is 



N = J2n a . (4.195) 
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That is, each collection of occupation numbers {?i Q } labels a unique many particle state | {n a } ). In the product 
£ , the collection {n a } occurs many times. We have therefore overcounted the contribution to Z N due to this state. 
By what factor have we overcounted? It is easy to see that the overcounting factor is 

N\ 

degree of overcounting = 



Il a nJ 



which is the number of ways we can rearrange the labels otj to arrive at the same collection {n a }. This follows 
from the multinomial theorem, 



K \ N 



E*«] = E E • • • E n , n T ■ ■ n ! X " & ■ ■ ■ X « + + ■ (4-196) 

r\— 1 / n . n. n 12 K 



Thus, the correct expression for Z N is 



7 - V P -/ 3 E Q "oe Q x 



EE-E 



n a "j 

Nl 



(4.197) 



When we study quantum statistics, we shall learn how to handle these constrained sums. For now it suffices to 
note that in the high temperature limit, almost all the n a are either or 1, hence 



Nl 



Z N ~^- (4.198) 



This is the classical Maxwell-Boltzmann limit of quantum statistical mechanics. We now see the origin of the 1/N\ 
term which is so important in the thermodynamics of entropy of mixing. 



4.9 Selected Examples 

4.9.1 Spins in an external magnetic field 

Consider a system of N s spins , each of which can be either up (a = +1) or down (a = —1). The Hamiltonian for 
this system is 

N, 

H=- f i Hj^a j , (4.199) 

3=1 

where now we write H for the Hamiltonian, to distinguish it from the external magnetic field H, and fj, is the 
magnetic moment per particle. We treat this system within the ordinary canonical ensemble. The partition func- 
tion is 

Z = E-"E e ^-C Ar S (4.200) 



where C is the single particle partition function: 



<7=±1 



C = e^/k B T = 2cQsh [ t^pj (4 201) 
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The Helmholtz free energy is then 

F{T,H,N 5 ) = -k B T\nZ = -N 5 k B T\n 

The magnetization is 



11 T§) 



M 



= N s n tanh ' ' 



dH / 



T, N 



KT 



The energy is 



£ = |(/3F)=-Wtanh(^). 



(4.202) 



(4.203) 



(4.204) 



Hence, E = —HM, which we already knew, from the form of H itself. 

Each spin here is independent. The probability that a given spin has polarization a is 

e 0n o Ha- 



P„ = 



(4.205) 



The total probability is unity, and the average polarization is a weighted average of a = +1 and a = — 1 contribu- 
tions: 



-Pf + -Pj = 1 



= P T -P i= tanh( k 



(4.206) 



At low temperatures T -C ^i H/k B , we have P» ps 1 — e 2 ^o H / k B T _ At high temperatures T > fx H/k B , the two 
polarizations are equally likely and P CT m | ( 1 



fe«T 



The isothermal magnetic susceptibility is defined as 

X'T 1 = 



N s \ dH ) T k B T 



KT 



(4.207) 



(Typically this is computed per unit volume rather than per particle.) At H = 0, we have X T = ^/k B T, which is 
known as the Curie law. 



Aside 



The energy E = —HM here is not the same quantity we discussed in our study of thermodynamics. In fact, 
the thermodynamic energy for this problem vanishes! Here is why. To avoid confusion, we'll need to invoke a 
new symbol for the thermodynamic energy, £ . Recall that the thermodynamic energy £ is a function of exten- 
sive quantities, meaning £ = £(S, M, N s ). It is obtained from the free energy F(T, H, N 5 ) by a double Legendre 
transform: 

£(S, M, N s ) = F(T, H, AQ +TS + HM . (4.208) 



Now from eqn. 4.202 we derive the entropy 



S 



dF 
df 



K k~ In 



2 cosh 



k B T) 



T 



KT 



(4.209) 



Thus, using eqns. 4.202 and 4.203, we obtain £(S, M, N s ) = 0. 
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The potential confusion here arises from our use of the expression F(T, H,N S ). In thermodynamics, it is the Gibbs 
free energy G(T,p, N) which is a double Legendre transform of the energy: G = £ — TS + pV. By analogy, with 
magnetic systems we should perhaps write G = £ — TS — HM, but in keeping with many textbooks we shall 
use the symbol F and refer to it as the Helmholtz free energy The quantity we've called E in eqn. 4.204 is in fact 
E = £ — HM, which means £ = 0. The energy £(S, M, N s ) vanishes here because the spins are noninteracting. 



4.9.2 Negative temperature (!) 



Consider again a system of N 5 spins, each of which can be either up (+) or down (— ). Let N a be the number of 
sites with spin a, where a = ±1. Clearly N + + N_ = N s . We now treat this system within the microcanonical 
ensemble. 



The energy of the system is 



-HM 



(4.210) 



where H is an external magnetic field, and M = (N + — N_)/j, is the total magnetization. We now compute S(E) 
using the ordinary canonical ensemble. The number of ways of arranging the system with N + up spins is 



N, 



(4.211) 



hence the entropy is 



S = k„ inn 



-N 5 k B ^x\nx + (1 - x)\n(l - a;) | 



(4.212) 



in the thermodynamic limit: iV 5 — > oo, N + — > oo, x = N + /N 5 constant. Now the magnetization is M = (N^ 
N_)[i = (2N + — N s )fx , hence if we define the maximum energy E = N 5 fJ, H, then 



E 

eZ 



M 



/'o 



1 - 2a; 



x 



Ep-E 
2E n 



(4.213) 



We therefore have 



S(E,N s ) = -N s k B 



2E n 



E\JE Q -E 



2E n 



2E n 



E 



2En 



(4.214) 



We now have 



T 



dEj 



N. 



dx dE 2E n 



In 



E -E 

E n +E 



(4.215) 



We see that the temperature is positive for — E < E < and is negative for < E < E . 

What has gone wrong? The answer is that nothing has gone wrong - all our calculations are perfectly correct. This 
system does exhibit the possibility of negative temperature. It is, however, unphysical in that we have neglected 
kinetic degrees of freedom, which result in an entropy function S(E, N s ) which is an increasing function of energy. 
In this system, S(E, N s ) achieves a maximum of S max = N s k B In 2 at E = (i.e. x = |), and then turns over and 
starts decreasing. In fact, our results are completely consistent with eqn. 4.204 : the energy E is an odd function 
of temperature. Positive energy requires negative temperature! Another example of this peculiarity is provided 
in the appendix in §4.11.2. 
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Figure 4.7: When entropy decreases with increasing energy, the temperature is negative. Typically kinetic degrees 
of freedom prevent this peculiarity from manifesting in physical systems. 



4.9.3 Adsorption 

PROBLEM : A surface containing N s adsorption sites is in equilibrium with a monatomic ideal gas. Atoms adsorbed 
on the surface have an energy —A and no kinetic energy. Each adsorption site can accommodate at most one 
atom. Calculate the fraction / of occupied adsorption sites as a function of the gas density n, the temperature T, 
the binding energy A, and physical constants. 

The grand partition function for the surface is 



S S urf = e- n ^ f /k B T = Q + e A/k B T eti /k B T^ s ^ ^ 



The fraction of occupied sites is 

Since the surface is in equilibrium with the gas, its fugacity z = exp(/j, / k B T) and temperature T are the same as in 
the gas. 

SOLUTION : For a monatomic ideal gas, the single particle partition function is ( = V\^ 3 , where X T = ^2Trh 2 /mk B T 
is the thermal wavelength. Thus, the grand partition function, for indistinguishable particles, is 

S gas = cxp(VA T 3 e ^B^ . (4.218) 

The gas density is 

n = = -i = Ay 3 e"/*B T . (4.219) 
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We can now solve for the fugacity: z = e^l k B T = nXj,. Thus, the fraction of occupied adsorption sites is 

n\9p + e ^^b 1 
Interestingly, the solution for / involves the constant h. 

It is always advisable to check that the solution makes sense in various limits. First of all, if the gas density tends 
to zero at fixed T and A, we have / — ► 0. On the other hand, if n — ► oo we have / — ► 1, which also makes sense. At 
fixed n and T, if the adsorption energy is (—A) — > — oo, then once again / = 1 since every adsorption site wants to 
be occupied. Conversely, taking (—A) — ► +oo results in n — > 0, since the energetic cost of adsorption is infinitely 
high. 



4.9.4 Elasticity of wool 

Wool consists of interlocking protein molecules which can stretch into an elongated configuration, but reversibly 
so. This feature gives wool its very useful elasticity. Let us model a chain of these proteins by assuming they can 
exist in one of two states, which we will call A and B, with energies e A and e B and lengths £ A and £ B . The situation 
is depicted in fig. 4.8. We model these conformational degrees of freedom by a spin variable a = ±1 for each 
molecule, where a = +1 in the A state and a = —1 in the B state. Suppose a chain consisting of TV monomers is 
placed under a tension r. We then have 



H = 



N 



Similarly the length is 



i=i 



A' 



^ = E[i(^ + 4) + l(4-4)^ 

3=1 

The Gibbs partition function is Y = Tr e~ K l k B T ; with K = H — tL : 

N 



(4.221) 



(4.222) 



K 



(4.223) 



where e A = e A — t£ a and e B = e B — t£ b . At r = the A state is preferred for each monomer, but when r exceeds 
t* , defined by the relation e A = e B , the B state is preferred. One finds 



(4.224) 



A BAB BAAABAA 



Figure 4.8: The monomers in wool are modeled as existing in one of two states. The low energy undeformed state 
is A, and the higher energy deformed state is B. Applying tension induces more monomers to enter the B state. 
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i i 1 r 




Tension tAI/Ae 




2 3 
Temperature k B T/Ac 

Figure 4.9: Upper panel: length L(r, T) for k B T/e = 0.01 (blue), 0.1 (green), 0.5 (dark red), and 1.0 (red). Bottom 
panel: dimensionless force constant k/N(A£) 2 versus temperature. 



Once again, we have a set of N noninteracting spins. The partition function is Y = ( N , where C is the single 
monomer partition function, 

C = Tr e 

where 



-fit* = e ~fi g A + e-^B 



is the single spin Hamiltonian. It is convenient to define the differences 

Ae = e B -e A , A£ = e B -£ A , Ae = e B -e A 
in which case the partition function Y is 

Y(T,t,N) = e~ N ^ 



1 + e 



N 



G(T, r, N) = Ne A - Nk B T\n 



1 + e 



-Ae/fc B T 



The average length is 



L=(L) = - 



dr ), 



T.N 



N£ A 



NA£ 



e (Ae-rAi)/k B T + j 



(4.225) 
(4.226) 



(4.227) 

(4.228) 
(4.229) 



(4.230) 
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The polymer behaves as a spring, and for small r the spring constant is 



k ~ dL 



T = 



4fc B T 

N(Aty 



cosh 



Ae 
2fc D T 



(4.231) 



The results are shown in fig. 4.9. Note that length increases with temperature for r < t* and decreases with 
temperature for r > r* . Note also that k diverges at both low and high temperatures. At low T, the energy gap 
Ae dominates and L = N£ A , while at high temperatures k B T dominates and L = ^N(£ A + £ B ). 



4.9.5 Noninteracting spin dimers 



Consider a system of noninteracting spin dimers as depicted in fig. 4.10. Each dimer contains two spins, and is 
described by the Hamiltonian 



(4.232) 



Here, J is an interaction energy between the spins which comprise the dimer. If J > the interaction is ferromagnetic, 
which prefers that the spins are aligned. That is, the lowest energy states are Iff) and \.[[). If J < the interaction 
is antiferromagnetic, which prefers that spins be anti-aligned: | f f ) and | If ). 6 

Suppose there are N d dimers. Then the OCE partition function is Z = £ where £(T, if) is the single dimer 
partition function. To obtain £(T, H), we sum over the four possible states of the two spins, obtaining 

C = Tre~^ dim ™/ fcBT 

= 2 e~ J / k * T + 2 e J l k * T cosh ( . 

Thus, the free energy is 

F(T,H,N d ) 
The magnetization is 

M = 



-7V d fc B rin2-iV d A: B Tlii 



-J/k B T + e J/k B T CQsh 



KT 



(4.233) 



dHJ, 



2N 



e J/k B T S inh(^) 

-J/fe B T + e J/k B T CQsh ( 



It is instructive to consider the zero field isothermal susceptibility per spin, 



1 dM 



2N d dH 



H=0 



Hi 

k„T ' e J / fc B T 



2 e J / fc B T 



-J/k B T 



(4.234) 



(4.235) 



The quantity u^/k B T is simply the Curie susceptibility for noninteracting classical spins. Note that we correctly 
recover the Curie result when J = 0, since then the individual spins comprising each dimer are in fact noninter- 
acting. For the ferromagnetic case, if J ^> k B T, then we obtain 



X T (J» k B T) 



M 

KT 



(4.236) 



This has the following simple interpretation. When J ^> k B T, the spins of each dimer are effectively locked in 
parallel. Thus, each dimer has an effective magnetic moment fx eS = 2/x . On the other hand, there are only half as 
many dimers as there are spins, so the resulting Curie susceptibility per spin is \ x (2/iq) 2 /k B T. 



6 Nota bene we are concerned with classical spin configurations only - there is no superposition of states allowed in this model! 
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Figure 4.10: A model of noninteracting spin dimers on a lattice. Each red dot represents a classical spin for which 
Uj = ±1. 

When — J 3> k B T, the spins of each dimer are effectively locked in one of the two antiparallel configurations. We 
then have 



X T (-J> k B T) 



2 Vo „- 



2\J\/k B T 



(4.237) 



In this case, the individual dimers have essentially zero magnetic moment. 



4.10 Statistical Mechanics of Molecular Gases 



4.10.1 Separation of translational and internal degrees of freedom 

The states of a noninteracting atom or molecule are labeled by its total momentum p and its internal quantum 
numbers, which we will simply write with a collective index a, specifying rotational, vibrational, and electronic 
degrees of freedom. The single particle Hamiltonian is then 



with 



The partition function is 



h\k , a ) 



Pl 4 

2m 

h 2 k 2 

2m 



a | ") 



C = Tr e-P~ h = e~* 2 / 2m £ 



9i e 



(4.238) 
(4.239) 
(4.240) 



Here we have replaced the internal label a with a label j of energy eigenvalues, with being the degeneracy of 
the internal state with energy e •. To do the p sum, we quantize in a box of dimensions L 1 x L 2 X • • • x L d , using 
periodic boundary conditions. Then 
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where each n i is an integer. Since the differences between neighboring quantized p vectors are very tiny, we can 
replace the sum over p by an integral: 



E 



A Pl ■ ■ ■ Ap d 



(4.242) 



where the volume in momentum space of an elementary rectangle is 

(2irH) d _ (2nh) d 



Thus, 



Ap v --A Pd = 



v 



{2irh) d 



(4.243) 



(4.244) 
(4.245) 



Here, £(T) is the internal coordinate partition function. The full TV-particle ordinary canonical partition function is 
then 

tN/m (4.246) 



Nl \X 

Using Stirling's approximation, we find the Helmholtz free energy F = — k s T In Z is 



F(T, V, N) = -Nk B T 



= -NKT 





y NX d ) 






ln( 


,NX d ) 







l + Inf(T) 
N<p(T) 



1 



(4.247) 



where 



<p(T) = -k B T]n£(T) 



(4.248) 



is the internal coordinate contribution to the single particle free energy. We could also compute the partition 
function in the Gibbs (T, p, N) ensemble: 



Y(T, p, N) = e-^^P^) = ^[dV e-P pV Z{T, V, N) 



Vn 



N 



(4.249) 



Thus, in the thermodynamic limit, 



N 



k„Thi 



fc B T 
KT 



KTln£(T) 



(4.250) 
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4.10.2 Ideal gas law 

Since the internal coordinate contribution to the free energy is volume-independent, we have 

V-m =«, (4-251) 

V d P Jt,n P 



and the ideal gas law applies. The entropy is 



1^1 



P,N 



\p\ d T ) 



ln[ ^ ] +1 + \d 



-Ntp'{T), (4.252) 



and therefore the heat capacity is 



C v= T [^f) n = i¥+ l ) N K NT ip"(T) (4.253) 



°v = T ( ||") = \dNk n - NT^"(T) . (4.254) 



BT I 2 

, U1 /v,n 

Thus, any temperature variation in C p must be due to the internal degrees of freedom. 

4.10.3 The internal coordinate partition function 

At energy scales of interest we can separate the internal degrees of freedom into distinct classes, writing 

h int = Kot + Kib + Klec (4.255) 

as a sum over internal Hamiltonians governing rotational, vibrational, and electronic degrees of freedom. Then 

£int = £rot ' £vib ' £clcc ■ (4.256) 

Associated with each class of excitation is a characteristic temperature 0. Rotational and vibrational temperatures 
of a few common molecules are listed in table tab. 4.1. 

4.10.4 Rotations 

Consider a class of molecules which can be approximated as an axisymmetric top. The rotational Hamiltonian is 
then 

L 2 + Ll L 2 



h„ 

(4.257) 



rot 2L 2/, 



h 2 L(L + l) ( 1 1 



2I X \2I 3 2/j 



L 2 

c ' 



where n a b (t) are the principal axes, with h c the symmetry axis, and L a b c are the components of the angular 
momentum vector L about these instantaneous body-fixed principal axes. The components of L along space-fixed 
axes {x, y, z} are written as L x > v > z . Note that 

[J/ , L c ] = < , V] + [L» , <] V = ie^ < L x + ie^ vX n x c V = , (4.258) 
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molecule 


©rotCK) 


vib (K) 


H 2 


85.4 


6100 


N 2 


2.86 


3340 


H 2 


13.7,21.0,39.4 


2290 , 5180 , 5400 



Table 4.1: Some rotational and vibrational temperatures of common molecules. 



which is equivalent to the statement that L c = n c ■ L is a rotational scalar. We can therefore simultaneously specify 
the eigenvalues of {L 2 , L z , L c }, which form a complete set of commuting observables (CSCO) 7 . The eigenvalues 
of L z are mh with m £ {— L, . . . , L}, while those of L c are kh with k g {— L, . . . , L}. There is a (2L + l)-fold 
degeneracy associated with the L z quantum number. 

We assume the molecule is prolate, so that I 3 < I 1 . We can the define two temperature scales, 

Prolateness then means > 0. We conclude that the rotational partition function for an axisymmetric molecule 
is given by 

oo L 

&otCn = E( 2L + !) e - i(i+i)e/T £; e - fc2 (§ - e)/T (4.260) 

L=0 k=-L 

In diatomic molecules, I 3 is extremely small, and ^ /c B T at all relevant temperatures. Only the k = term 
contributes to the partition sum, and we have 

4 ot (T) = £(2L+l)e^ 1 )^. (4.261) 

L=0 

When T <C 0, only the first few terms contribute, and 

4 ot (T) = 1 + 3 e- 20 / T + 5 e~ e0 / T + ... (4.262) 

In the high temperature limit, we have a slowly varying summand. The Euler-MacLaurin summation formula may 
be used to evaluate such a series: 



iv - OO 

J2 F k = J dkF ( k ) + h[F(0) + F(n)] + £ 7^7 [F {23 ~ 1] {n) - F^(0) 



(4.263) 



- (2j)! 

where is the j th Bernoulli number where 

B o = 1 , B 1 = -5 , ^2 = 6 ' ^4 = _ 30 i ^6 = 42 • (4.264) 

Thus, 

oo 

oo 

E *k = / da: + |F(0) - ^' (0) - — F'" (0) + . . . . (4.265) 

1 n " 



fc=0 



7 Note that while we cannot simultaneously specify the eigenvalues of two components of L along axes fixed in space, we can simulta- 
neously specify the components of L along one axis fixed in space and one axis rotating with a body. See Landau and Lifshitz, Quantum 
Mechanics, §103. 
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We have F(x) = (2x + 1) e - x( - x +V e / T , f or w hi c h /da; F(x) = § , hence 

o 

T 1 10 4 /<9 x2 



- ' 3 + 15T + 3lHrJ + (4266) 

Recall that ^(T) = -k B T]n£(T). We conclude that y rot (T) w -3k B T e~ 20 / T for T < 6>and^ rot (T) w -fc B Tln(T/<9) 
for T ^> 0. We have seen that the internal coordinate contribution to the heat capacity is AC V = —NT(p"{T). For 
diatomic molecules, then, this contribution is exponentially suppressed for T <C 6*, while for high temperatures 
we have AC y = Nk B . One says that the rotational excitations are 'frozen out' at temperatures much below O. 
Including the first few terms, we have 

AC V (T « 0) = 12 Nk B (^j e -20/T + _ _ ( 4 267) 

1 f 9\ 2 16 /6> x3 



A^(T»0)=iV fcB |l + -^-J + -^-j + ...|. (4.268) 

Note that C v overshoots its limiting value of Nk B and asymptotically approaches it from above. 

Special care must be taken in the case of homonuclear diatomic molecules, for then only even or odd L states are 
allowed, depending on the total nuclear spin. This is discussed below in §4.10.7. 

For polyatomic molecules, the moments of inertia generally are large enough that the molecule's rotations can be 
considered classically. We then have 

e( L„L b ,LJ = J| + i| + Ji. (4269, 



We then have 

1 f dL a dl h dL c d^d9dtp r _ e(u u L c )/fc B T 

<?rot J M 



£ rot(T ) = — / \ZlZ Lb LJ/fcBT . ( 4 - 27 0) 



where (cj),9ip) are the Euler angles. Recall (j> £ [0, 2ir], 9 £ [0, 7r], and t/j £ [0,2tv\. The factor g rot accounts for 
physically indistinguishable orientations of the molecule brought about by rotations, which can happen when 
more than one of the nuclei is the same. We then have 



[2k B T 



3/2 



4ot(T)=(^H V^Ws- (4-271) 



This leads to AC V = %Nk B 



4.10.5 Vibrations 



Vibrational frequencies are often given in units of inverse wavelength, such as cm 1 , called a wavenumber. To 
convert to a temperature scale T* , we write k B T* = hv = hc/X, hence T* = (hc/k B ) A -1 , and we multiply by 

he 

— = 1.436 K- cm. (4.272) 

B 

For example, infrared absorption (~ 50 cm -1 to 10 4 cm -1 ) reveals that the 'asymmetric stretch' mode of the H 2 
molecule has a vibrational frequency of v = 3756 cm -1 . The corresponding temperature scale is T* = 5394 K. 
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Vibrations are normal modes of oscillations. A single normal mode Hamiltonian is of the form 

h = — + \muo 2 q 2 = hu(a!a + i) . (4.273) 

In general there are many vibrational modes, hence many normal mode frequencies oj a . We then must sum over 
all of them, resulting in 

£vib = rUvfb- (4-274) 

a 

For each such normal mode, the contribution is 



oo 



£ = ^2 e -(™+5)^/ fc B^ = e -tuo/2k B T ^2 

n=a 71=0 
e -hoj/2k B T i 



-hu/k-gT 



l _ e -nu/k B T 2 sinh(6> /2T) 



where = hu;/k B . Then 



cp = fc B Tln f2sinh(<9/2T)) 
= \k B + fc B Tln(l - e- ' T ) 

The contribution to the heat capacity is 

'0\ 2 e e ' T 

ACV = Nk 1 



B \T J (e e l T -l) 2 

' Nk B (0/T) 2 exp(-0/r) (T -> 0) 
Nk B (T -► oo) 



(4.275) 



(4.276) 



(4.277) 



4.10.6 Two-level systems : Schottky anomaly 

Consider now a two-level system, with energies e and e 1 . We define A = e 1 — e and assume without loss of 
generality that A > 0. The partition function is 

C = e- 0s ° + e"^ = e _/,e ° (l + e~ pA ) . (4.278) 

The free energy is 

/ = -k B T In ( = e - k B T In (l + e - A / fe B T ) . (4.279) 
The entropy for a given two level system is then 

• = -% = fc B m (1 + e" W) + £ . (4.280) 



and the heat capacity is = T (ds/dT), i.e. 



A 2 e A/k B T 

C ( T ) = T7^ 2 ■ ( 4 -281) 
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Thus, 



c (T « A) = ^2 e~ A/kBT (4.282) 



2 



c(T»A) = ^. (4.283) 

We find that c(T) has a characteristic peak at T* « 0.42 A/fc B . The heat capacity vanishes in both the low tem- 
perature and high temperature limits. At low temperatures, the gap to the excited state is much greater than k B T, 
and it is not possible to populate it and store energy. At high temperatures, both ground state and excited state 
are equally populated, and once again there is no way to store energy. 

If we have a distribution of independent two-level systems, the heat capacity of such a system is a sum over the 
individual Schottky functions: 

oo 

C(T) = 5>(A ?; /fc B T) = N jdA P(A) c(A/T) , (4.284) 

o 

where N is the number of two level systems, c(x) = k B x 2 e x / (e x + l) 2 , and where P( A) is the normalized distri- 
bution function, which satisfies the normalization condition 

dA P(A) = 1 . (4.285) 

N s is the total number of two level systems. If P(A) cx A r for A — > 0, then the low temperature heat capacity 
behaves as C(T) oc T 1+r . Many amorphous or glassy systems contain such a distribution of two level systems, 
with r ~ for glasses, leading to a linear low-temperature heat capacity. The origin of these two-level sys- 
tems is not always so clear but is generally believed to be associated with local atomic configurations for which 
there are two low-lying states which are close in energy. The paradigmatic example is the mixed crystalline solid 
(KBr) 1 _ a .(KCN) a . which over the range 0.1 < x < 0.6 forms an 'orientational glass' at low temperatures. The two 
level systems are associated with different orientation of the cyanide (CN) dipoles. 
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4.10.7 Electronic and nuclear excitations 



For a monatomic gas, the internal coordinate partition function arises due to electronic and nuclear degrees of 
freedom. Let's first consider the electronic degrees of freedom. We assume that k B T is small compared with 
energy differences between successive electronic shells. The atomic ground state is then computed by filling up 
the hydrogenic orbitals until all the electrons are used up. If the atomic number is a 'magic number' (A = 2 (He), 
10 (Ne), 18 (Ar), 36 (Kr), 54 (Xe), etc.) then the atom has all shells filled and L = and S = 0. Otherwise the last 
shell is partially filled and one or both of L and S will be nonzero. The atomic ground state configuration 2,7+1 L s 
is then determined by Hund's rules: 

1. The LS multiplet with the largest S has the lowest energy. 

2. If the largest value of S is associated with several multiplets, the multiplet with the largest L has the lowest 
energy. 

3. If an incomplete shell is not more than half- filled, then the lowest energy state has J = \L — S\. If the shell is 
more than half-filled, then J = L + S. 



The last of Hund's rules distinguishes between the (25 + 1)(2£ + 1) states which result upon fixing S and L as 
per rules #1 and #2. It arises due to the atomic spin-orbit coupling, whose effective Hamiltonian may be written 
H = AL ■ S, where A is the Russell-Saunders coupling. If the last shell is less than or equal to half-filled, then 
A > and the ground state has J = \L — S\. If the last shell is more than half -filled, the coupling is inverted, i.e. 
A < 0, and the ground state has J = L + S. 8 



The electronic contribution to £ is then 



clcc 



J2 (2J+l)e- A£ ( i < s ' J >/ fc B T (4.286) 



J=\L-S\ 



where 

Ae(L, S, J) = \A J (J + 1) - L(L + 1) - 5(5 + 1) 



(4.287) 



At high temperatures, k B T is larger than the energy difference between the different J multiplets, and we have 
£eiec ~ (2-^ + 1)(25 + 1) e~P e °, where e Q is the ground state energy. At low temperatures, a particular value of 
J is selected - that determined by Hund's third rule - and we have £ oloc ~ (2 J + 1) e _/3e o. ]f r in addition, there 
is a nonzero nuclear spin /, then we also must include a factor £ nuc = (21 +1), neglecting the small hyperfine 
splittings due to the coupling of nuclear and electronic angular momenta. 

For heteronuclear diatomic molecules, i.e. molecules composed from two different atomic nuclei, the internal par- 
tition function simply receives a factor of £ olcc ■ £nuc ■ £nuc, where the first term is a sum over molecular electronic 
states, and the second two terms arise from the spin degeneracies of the two nuclei. For homonuclear diatomic 
molecules, the exchange of nuclear centers is a symmetry operation, and does not represent a distinct quantum 
state. To correctly count the electronic states, we first assume that the total electronic spin is 5 = 0. This is gen- 
erally a very safe assumption. Exchange symmetry now puts restrictions on the possible values of the molecular 
angular momentum L, depending on the total nuclear angular momentum I tot . If 7 tot is even, then the molecular 
angular momentum L must also be even. If the total nuclear angular momentum is odd, then L must be odd. This 
is so because the molecular ground state configuration is 1 ^'^"- 9 

The total number of nuclear states for the molecule is (21 + l) 2 , of which some are even under nuclear exchange, 
and some are odd. The number of even states, corresponding to even total nuclear angular momentum is written 



See e.g. §72 of Landau and Lifshitz, Quantum Mechanics, which, in my humble estimation, is the greatest physics book ever written. 
'See Landau and Lifshitz, Quantum Mechanics, §86. 
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21 


9g 


9u 


odd 


1(21+ 1) 


(J+1)(2J+1) 


even 


(/+1)(2/+1) 


1(2/ +1) 



Table 4.2: Number of even (g g ) and odd (g u ) total nuclear angular momentum states for a homonuclear diatomic 
molecule. I is the ground state nuclear spin. 



as g g , where the subscript conventionally stands for the (mercifully short) German word gerade, meaning 'even'. 
The number of odd (Ger. ungerade) states is written g u . Table 4.2 gives the values of g g u corresponding to half- 
odd-integer I and integer I. 

The final answer for the rotational component of the internal molecular partition function is then 

Ut(T)=g g C g +g u ( u , (4.288) 

where 

( g = (2£ + l)e-^ i+1 > e -/ T 

L even (4.289) 

c„= J2( 2L+1 ^" L(L+1)0 " JT - 

L odd 

For hydrogen, the molecules with the larger nuclear statistical weight are called orthohydrogen and those with the 
smaller statistical weight are called parahydrogen. For H 2 , we have I = \ hence the ortho state has g u = 3 and the 
para state has g g = 1. In D 2 , we have 1 = 1 and the ortho state has g g = 6 while the para state has g u = 3. In 
equilibrium, the ratio of ortho to para states is then 



iV H 2 _ 9u Ca _ 3 Cu iV D 2 _ 9g Cg _ 2 Cg 

9g(g Cg 9u Cu Cu 



(4.290) 



Incidentally, how do we derive the results in Tab. 4.10.7 ? The total nuclear angular momentum 7 tot is the quan- 
tum mechanical sum of the two individual nuclear angular momenta, each of which are of magnitude /. From 
elementary addition of angular momenta, we have 

/g)7 = 0el©2©---©2/. (4.291) 

The right hand side of the above equation lists all the possible multiplets. Thus, I tot E {0, 1, ... , 21}. Now let us 
count the total number of states with even I tot . If 21 is even, which is to say if I is an integer, we have 



i 

(27— even) _ 

"a 

n=0 



£{2.(2n) + l} = (/ + l)(2/+l), (4.292) 



because the degeneracy of each multiplet is 2I tot + 1. It follows that 

ff (2/=even) = + ^2 _ ^ = j^j + y (4 293) 

On the other hand, if 21 is odd, which is to say I is a half odd integer, then 

M 

{2 • (2n) + l} = 1(21 + 1) . (4.294) 



(2/=odd) 
'9 

n=0 



It follows that 

ff ( 2 /=odd) = (2/ + 1)8 _ „ = (J + i)( 2 J + 1) . (4.295) 
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4.11 Appendix I : Additional Examples 
4.11.1 Three state system 

Consider a spin-1 particle where a = —1,0, +1. We model this with the single particle Hamiltonian 

h = -/i Fcr + A(l-cr 2 ) . (4.296) 

We can also interpret this as describing a spin if a = ±1 and a vacancy if a = 0. The parameter A then represents 
the vacancy formation energy. The single particle partition function is 

C = Tr er ph = e~^ A + 2 cosh(f3^ H) . (4.297) 
With N s distinguishable noninteracting spins (e.g. at different sites in a crystalline lattice), we have Z = £ N * and 

F = NJ = -k B T In Z = —N s k B T In [e" /3A + 2 cosh(/3/i F)l , (4.298) 

where / = — k B T In £ is the free energy of a single particle. Note that 

F)h 

n v = l-a 2 = — (4.299) 

m = fi a=-— (4.300) 
are the vacancy number and magnetization, respectively. Thus, 

n v = (K)=Q K = e _ A/feBT + 2 cosh(MoH/fcBT) < 4 - 301 ) 

and 

m= / rfl \ = _^/ = 2 Mo S inh( Mo g/fc B T) 



At weak fields we can compute 

dm 
Xt =8H 



j4_ 2 

h=o KT 2 + 



(4.303) 



We thus obtain a modified Curie law. At temperatures T <C A/fc B , the vacancies are frozen out and we recover the 
usual Curie behavior. At high temperatures, where T 3> A/fc B , the low temperature result is reduced by a factor 
of |, which accounts for the fact that one third of the time the particle is in a nonmagnetic state with a = 0. 



4.11.2 Spins and vacancies on a surface 

PROBLEM: A collection of spin-1 particles is confined to a surface with N sites. For each site, let a = if there 
is a vacancy, a = + 1 if there is particle present with spin up, and a = — 1 if there is a particle present with spin 
down. The particles are non-interacting, and the energy for each site is given by e = —Wa 2 , where — W < is the 
binding energy. 

(a) Let Q = + be the number of spins, and N Q be the number of vacancies. The surface magnetization is 
M = N+ — N, . Compute, in the microcanonical ensemble, the statistical entropy S(Q, M). 
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(b) Let q = Q/N and m = M/N be the dimensionless particle density and magnetization density, respectively. 
Assuming that we are in the thermodynamic limit, where N, Q, and M all tend to infinity, but with q and m 
finite, Find the temperature T(q, to). Recall Stirling's formula 

ln(iV!) = N\nN - N + 0(\nN) . 

(c) Show explicitly that T can be negative for this system. What does negative T mean? What physical degrees 
of freedom have been left out that would avoid this strange property? 



SOLUTION : There is a constraint on N v N , and N^. 

N^ + N + N l = Q + N = N 
The total energy of the system is E = — WQ. 



(a) The number of states available to the system is 

n 



N^NqIN^. 



Fixing Q and M, along with the above constraint, is enough to completely determine {N^, N , N,}: 
7V T = |(Q + M) , N = N-Q , N l = \{Q-M) , 

Nl 

0(0, M) 



whence 



[1{Q + M)]\[\{Q-M)]\{N-Q)\ • 
The statistical entropy is S = fc B In fi: 

S(Q, M) = k B In(JV!) - fc B In [|(Q + M)!] - k B In [\{Q - A/)!] - k B In [(N - Q)!] 
(b) Now we invoke Stirling's rule, 



to obtain 



ln(iV!) = iV In N - N + C(ln N) , 

lnfi(Q, M) = NkiN-N-±(Q + M) In [\{Q + M)] + \{Q + M) 
- i(Q - AO In [\{Q - M)] + |(Q - AO 
- (N -Q) \n(N - Q) + (N -Q) 



N\nN- igin 



1 (Q 2 - A/ 2 



2 VQ-^ 



Combining terms, 



lnfi(Q,M) = -JVgln W? 2 - m 2 



- iJVmlnf ) - iV(l - q) ln(l - 9 ) 

g — to 



(4.304) 



(4.305) 

(4.306) 
(4.307) 

(4.308) 
(4.309) 



(4.310) 



(4.311) 



where Q = Nq and AI = Nm. Note that the entropy S = fc B InO is extensive. The statistical entropy per site 
is thus 



s(q, to) = — fc B gin 



2 '"B 



g — to 



- ifc„ to In ( ) - fc B (1 - q) ln(l - q) 



(4.312) 
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The temperature is obtained from the relation 



T ~ \dEJ M ~ W \dq~ 



= ^- ln(l - q) — ^ In \\j q 2 - m 2 



1 

W 



(4.313) 



Thus, 



T 



W/k E 



In [2(1 -q)/yjq 2 -m 2 



(4.314) 



(c) We have < q < 1 and — g < m < q, so T is real (thank heavens!). But it is easy to choose {q, m} such that 
T < 0. For example, when m — we have T = W/k B ln(2g _1 — 2) and T < for all g€ (§>!]• The reason for 
this strange state of affairs is that the entropy S is bounded, and is not an monotonically increasing function 
of the energy E (or the dimensionless quantity Q). The entropy is maximized for N 1= N = N, = ^, which 
says m = and q = | . Increasing q beyond this point (with m = fixed) starts to reduce the entropy, and 
hence (dS/dE) < in this range, which immediately gives T < 0. What we've left out are kinetic degrees of 
freedom, such as vibrations and rotations, whose energies are unbounded, and which result in an increasing 
S(E) function. 



4.11.3 Fluctuating interface 

Consider an interface between two dissimilar fluids. In equilibrium, in a uniform gravitational field, the denser 
fluid is on the bottom. Let z = z(x,y)be the height the interface between the fluids, relative to equilibrium. The 
potential energy is a sum of gravitational and surface tension terms, with 

z 

t^grav = Jd 2 xj dz' Ap g z' (4.315) 



C7 surf = Jd 2 x\o{Vz) 2 . (4.316) 

We won't need the kinetic energy in our calculations, but we can include it just for completeness. It isn't so clear 
how to model it a priori so we will assume a rather general form 



T = J&JM 1,( X , X >) «fM . (4.317) 

We assume that the (x, y) plane is a rectangle of dimensions L x x L y . We also assume fi(x, x') — fi\jx — x'\). We 
can then Fourier transform 

z{x) = {L x L y )- l/2 Y J z k z lkx , (4-318) 
k 

where the wavevectors k are quantized according to 

2im 27m„, 

fe = ftp*, x + —JL y , (4.319) 

with integer n x and n y , if we impose periodic boundary conditions (for calculational convenience). The La- 
grangian is then 

L = \Y.Vk\ik\ 2 -(^P + ^ 2 )\z k \ 2 ] , (4.320) 
k 
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where 



H k = d 2 xp(\x\) e 



ikx 



(4.321) 



Since z(x, t) is real, we have the relation z = zt, therefore the Fourier coefficients at k and —k are not indepen- 
dent. The canonical momenta are given by 



dL 

dzi 



Pk - ~QjT — Mfc z k 



The Hamiltonian is then 



H = 



E [ p k z k 



k + Pk z k 



E' 



[gAp + ak 2 ) \z k \ 



(4.322) 

(4.323) 
(4.324) 



where the prime on the k sum indicates that only one of the pair {k, — k} is to be included, for each k. 
We may now compute the ordinary canonical partition function: 



n' ¥ 



(2irh) 2 

2 



3 Iffcl /^feBT e -(a Ap+ak') \z k \ 2 /k B T 



Thus, 



where 10 



2h J \gAp+ak< 



F = -fe B T^ln 



(4.325) 



KT 



gAp + ak" 1 



2\l/2 



(4.326) 



(4.327) 



is the normal mode frequency for surface oscillations at wavevector k. For deep water waves, it is appropriate to 
take p k = Ap/\k\, where Ap = p L — p G « p L is the difference between the densities of water and air. 

It is now easy to compute the thermal average 

(\z k \ 2 ) = jd\ \ Zk \2 e -(9Ap+°k*)\* k \>/k B T I e - (s Ap + afc 2 )|, fc |Vfe B T 



(4.328) 



g Ap + ok 2 

Note that this result does not depend on p k , i.e. on our choice of kinetic energy. One defines the correlation function 



C(x) ee ( z(x) z(0) > = -±- £ < \z k \ 2 ) e*" = ' 

x y k 



(2tt) 2 \gAp + (jk 2 



ikx 



(4.329) 



"Note that there is no prime on the k sum for F, as we have divided the logarithm of Z by two and replaced the half sum by the whole 
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where £ = \fg Ap/er is the correlation length, and where K a (z) is the Bessel function of imaginary argument. The 
asymptotic behavior of i^o(z) for small z is K (z) ~ ln(2/z), whereas for large z one has K (z) ~ (n/2z) 1 ^ 2 e~ z . We 
see that on large length scales the correlations decay exponentially but on small length scales they diverge. This 
divergence is due to the improper energetics we have assigned to short wavelength fluctuations of the interface. 
Roughly, it can cured by imposing a cutoff on the integral, or by insisting that the shortest distance scale is a 
molecular diameter. 



4.11.4 Dissociation of molecular hydrogen 

Consider the reaction 

H p+ + e" . (4.330) 

In equilibrium, we have 

Mh = M P + M c ■ (4.331) 
What is the relationship between the temperature T and the fraction x of hydrogen which is dissociated? 
Let us assume a fraction x of the hydrogen is dissociated. Then the densities of H, p, and e are then 

njj = (1 — x) n , rip = xn , n c — xn . (4.332) 

The single particle partition function for each species is 

C=C(^-) N r- N£ ~ Jk * T > (4-333) 



Nl Va| 

where g is the degeneracy and e int the internal energy for a given species. We have e int = for p and e, and 
£ int = — A for H, where A = e 2 /2a B = 13.6 eV, the binding energy of hydrogen. Neglecting hyperfine splittings 11 , 
we have g H = 4, while g c = g p = 2 because each has spin S = \ . Thus, the associated grand potentials are 

= -9h v KT A t 3 h e ^H+A)/fc B ^ (4334) 

Q p = -g p Vk B T A T 3 p e^ k * T (4.335) 

= -9c Vk B T A T 3 c e^' k * T , (4.336) 

where 



/ 2nh 2 

X T,a = \lzn7^ (4-337) 



m a k B T 



for species a. The corresponding number densities are 

= s A^e ( "- e »' )/fc B T , (4.338) 



1 f^T] 



T.V 



and the fugacity z = e^/ fc B T D f a given species is given by 



z = g 1 n 



Ay e £ ^ /k * T . (4.339) 



11 The hyperfine splitting in hydrogen is on the order of (m c /m p ) a 4 m e c 2 ~ eV, which is on the order of 0.01 K. Here a = e 2 /He is 
the fine structure constant. 
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We now invoke /x H = fi p + fi c , which says z H = z p z e , or 

.9h^h A t , h ^ A/fceT = (<?p \ A T , p ) {g-W Ay , (4.340) 

which yields 

' '-*L} n\% = e - A / k * T , (4.341) 
1 — x J 



where At = \/2nh 2 /m*k- B T, with m* = m p m e /m H f=a m e . Note that 



where a B = 0.529 A is the Bohr radius. Thus, we have 

,3/2 



1 - X 



(4?r)3/2 " = (^) e_T ° /T ' (4343) 



whereT = A/fc B = 1.578 x 10 5 K and v = naf. Consider for example a temperature T = 3000 K, for which T /T = 
52.6, and assume that x = |. We then find v = 1.69 x 10~ 27 , corresponding to a density of n = 1.14 x 10~ 2 cm~ 3 . At 
this temperature, the fraction of hydrogen molecules in their first excited (2s) state is x' ~ e - T o/ 2T = 3.8 x 10~ 12 . 
This is quite striking: half the hydrogen atoms are completely dissociated, which requires an energy of A, yet 
the number in their first excited state, requiring energy 1 A, is twelve orders of magnitude smaller. The student 
should reflect on why this can be the case. 
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Noninteracting Quantum Systems 
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- F. Reif, Fundamentals of Statistical and Thermal Physics (McGraw-Hill, 1987) 
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(Benjamin Cummings, 2000) 
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- D. V. Schroeder, An Introduction to Thermal Physics (Addison- Wesley, 2000) 
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- R. K. Pathria, Statistical Mechanics (2 nd edition, Butterworth-Heinemann, 1996) 

This popular graduate level text contains many detailed derivations which are helpful for the student. 

- M. Plischke and B. Bergersen, Equilibrium Statistical Physics (3 rd edition, World Scientific, 2006) 

An excellent graduate level text. Less insightful than Kardar but still a good modern treatment of the subject. 
Good discussion of mean field theory. 
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This is volume 5 in the famous Landau and Lifshitz Course of Theoretical Physics. Though dated, it still 
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5.2 Statistical Mechanics of Noninteracting Quantum Systems 
5.2.1 Bose and Fermi systems in the grand canonical ensemble 

A noninteracting many-particle quantum Hamiltonian may be written as 

H = J2 £ <*K, (5-1) 

a 

where n a is the number of particles in the quantum state a with energy e a . This form is called the second quantized 
representation of the Hamiltonian. The number eigenbasis is therefore also an energy eigenbasis. Any eigenstate of 
H may be labeled by the integer eigenvalues of the n a number operators, and written as | n 1 , n 2 , . . . ) . We then 
have 

n a \n) =n a \n) (5.2) 

and 

H\n) ^^n a e a \n) . (5.3) 

The eigenvalues n a take on different possible values depending on whether the constituent particles are bosons or 
fermions, viz. 

bosons : n a (={0,1,2,3,...} (5.4) 
fermions : n a £ {0 , l} . (5.5) 

In other words, for bosons, the occupation numbers are nonnegative integers. For fermions, the occupation num- 
bers are either or 1 due to the Pauli principle, which says that at most one fermion can occupy any single particle 
quantum state. There is no Pauli principle for bosons. 

The N -particle partition function Z N is then 

Z N=J2 e ~^ M6 N^ a n^ (5-6) 

where the sum is over all allowed values of the set {n a }, which depends on the statistics of the particles. Bosons 
satisfy Bose-Einstein (BE) statistics, in which n a e {0,1,2,...}. Fermions satisfy Fermi-Dirac (FD) statistics, in 
which n a e {0 , 1}. 

The OCE partition sum is difficult to perform, owing to the constraint J2 a n a — N on the total number of particles. 
This constraint is relaxed in the GCE, where 

N 



E 



=n e^ m "| ( s - 7 ) 

Note that the grand partition function S takes the form of a product over contributions from the individual single 
particle states. 
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We now perform the single particle sums: 

y* e -/3(=-/»)n = L (bosons) (5.8) 

n=0 
1 

e -^(=-M) « = i + e -0(e-/*) (fermions) . (5.9) 



n=0 



Therefore we have 



« BE = fc B r^ln(l - e-^-")/^) (5.11) 



and 



s FD =n( i +^ (e ^ Ai)/fcBT ) ( 5 - 12 ) 

a 

^ FD = -fc B T^ln(l + e -( e <.-")/*B T ) . (5.13) 



We can combine these expressions into one, writing 

fJ(T, V, /i) = ±fc B T ln(l T e - (e °- rf/fc B T ") , (5.14) 



where we take the upper sign for Bose-Einstein statistics and the lower sign for Fermi-Dirac statistics. Note that 
the average occupancy of single particle state a is 

( * J = W a = e^LT i 1 ' (5 " 15) 

and the total particle number is then 

j V(T,y ;M ) = ^ e(Ea _ M)/ 1 fcB (5.16) 
We will henceforth write n a (fi, T) = (n a ) for the thermodynamic average of this occupancy. 

5.2.2 Maxwell-Boltzmann limit 

Note also that if n a (jj,, T)«l then fx -c s a — fc B T, and 

— fl MB = -k B Tj2 e-< e °-»V k » T . (5.17) 

Of 

This is the Maxwell-Boltzmann limit of quantum statistical mechanics. The occupation number average is then 

(n a ) = e -(«.-«)/*BT (5.18) 

in this limit. 
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5.2.3 Single particle density of states 

The single particle density of states per unit volume g(e) is defined as 



V 

We can then write 



0(T,V,fj,) = ±Vk B T Jdeg{e) ln(l T e- {e -^' k ^ T 



(5.20) 



For particles with a dispersion e(k), with p = hk, we have 



^=«/(^* (e " e(fc)) 



(5.21) 



(27r) d de/dfc 

where g = 25+1 is the spin degeneracy. Thus, we have 



5lej (27r) d rfe/dfc i 



* 4£ d= 1 

7r tie 



^fcf d = 2 (5.22) 



s fc 2 # d = 3 . 

In order to obtain g(e) as a function of the energy e one must invert the dispersion relation e = e(k) to obtain 
ft = fc(e). 



Note that we can equivalently write 



ff ( e )£fe = g = 1%^-!* (5.23) 



to derive g(e). 

For a spin-5 particle with ballistic dispersion e(k) = h 2 k 2 /2m, we have 

where 0(e) is the step function, which takes the value for e < and 1 for e > 0. The appearance of 0(e) simply 
says that all the single particle energy eigenvalues are nonnegative. Note that we are assuming a box of volume 
V but we are ignoring the quantization of kinetic energy, and assuming that the difference between successive 
quantized single particle energy eigenvalues is negligible so that g(e) can be replaced by the average in the above 
expression. Note that 

n(g,T,/*)= e(e _ tl)/kB T Tl ■ (5-25) 
This result holds true independent of the form of g(s). The average total number of particles is then 

oo 

N(T, V, M ) = vjdsg(s) e(£ _J kBTTl , (5-26) 

— OO 

which does depend on g(e). 
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5.3 Quantum Ideal Gases : Low Density Expansions 

5.3.1 Expansion in powers of the fugacity 

From eqn. 5.26, we have that the number density n = N/V is 

oo 

n(T,z)= [de — 9 ^ 



e/k B T -p 1 



(5.27) 



where z = exp(/j, / k B T) is the fugacity and 

oo 

Cj(T)= Jdeg(e)e-^ k - T . (5.28) 

— OO 

From Q = —pV and our expression above for i?(T, V, fi), we have 

OO 

p(T, z)=T K T J de 9( £ ) ln (! T z e~ e/k ^ 



(5.29) 



fc B T^(±ir 1 j- 1 q(T) z> 

3 = 1 



5.3.2 Virial expansion of the equation of state 

Eqns. 5.27 and 5.29 express n(T, z) and p(T, z) as power series in the fugacity z, with T-dependent coefficients. In 
principal, we can eliminate z using eqn. 5.27, writing z = z(T, n) as a power series in the number density n, and 
substitute this into eqn. 5.29 to obtain an equation of state p = p(T, n) of the form 

p(T, n) = n k B T (l + B 2 {T) n + B 3 {T) n 2 + . . . \ . (5.30) 

Note that the low density limit n — > yields the ideal gas law independent of the density of states g(e). This 
follows from expanding n (T, z) and p(T, z) to lowest order in z, yielding n = C 1 z+0(z 2 ) andp = k B T C x z+0(z 2 ). 
Dividing the second of these equations by the first yields p = n k B T + 0(n 2 ), which is the ideal gas law. Note that 
z = n/C l + 0(n 2 ) can formally be written as a power series in n. 

Unfortunately, there is no general analytic expression for the virial coefficients Bj(T) in terms of the expansion 
coefficients rij(T). The only way is to grind things out order by order in our expansions. Let's roll up our sleeves 
and see how this is done. We start by formally writing z(T, n) as a power series in the density n with T-dependent 
coefficients A^T): 

z = A x n + A 2 n 2 + A 3 n 3 + . . . . (5.31) 
We then insert this into the series for n(T, z): 

n = C\z±C 2 z 2 + C 3 z 3 + ... (5.32) 
= C\ [A x n + A 2 n 2 + A 3 n 3 + . . . ) ± C 2 (A x n + A 2 n 2 + A 3 n 3 + . . . ) 2 
+ C 3 (A x n + A 2 n 2 + A 3 n 3 + . . . ) 3 + . . . . 
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Let's expand the RHS to order n 3 . Collecting terms, we have 

n = C x A x n + (C 1 A 2 ± C 2 A 2 ) n 2 + (C x A 3 ± 2C 2 A X A 2 + C 3 A 3 )n 3 + ... (5.33) 

In order for this equation to be true we require that the coefficient of n on the RHS be unity, and that the coefficients 
of •)!? for all j > 1 must vanish. Thus, 

C X A X = 1 

C x A 2 ±C 2 A\ = () (5.34) 
C x A 3 ± 2C 2 A X A 2 + C 3 A 3 = . 

The first of these yields A 1 : 

A = ^r- (5-35) 
We now insert this into the second equation to obtain A 2 : 



A 2 = T^- (5.36) 



Next, insert the expressions for Aj and A 2 into the third equation to obtain A s : 

*2Cn Co 



A3 = = cf-c L f (5 ' 37) 



This procedure rapidly gets tedious! 

And we're only half way done. We still must express p in terms of n: 

= C 1 (A x n + A 2 n 2 + A 3 n 3 + . . . ) ± \C 2 [A x n + A 2 n 2 + A 3 n 3 



KT 



+ \C 3 (A x n + A 2 n 2 + A 3 n 3 + . . . ) 3 + . . . 
C x A, n + (C x A 2 ± \C 2 A\)n 2 + (C x A 3 ± C 2 A x A 2 + | C 3 A 3 



(5.38) 



n + B 2 n A + B 3 rC' 



We can now write 



B 2 = C x A 2 ±\C 2 Al = T ^ 2 (5.39) 



B 3 = C x A 3 ± C 2 A, A 2 + i C 3 A 3 = gf - |^ . (5.40) 

It is easy to derive the general result that £?J = (— 1) J ' _1 SJ, where the superscripts denote Fermi (F) or Bose (B) 
statistics. 

We remark that the equation of state for classical (and quantum) interacting systems also can be expanded in terms 
of virial coefficients. Consider, for example, the van der Waals equation of state, 



aN 2 
~V 2 



[V - Nb) = Nk B T . (5.41) 



This may be recast as 



nk B T 2 
p = an 

1 - bn (5.42) 



= nk B T + (b k B T - a) n 2 + k B T b 2 n 3 + k B T b 3 n A + ... , 
where n = N/V. Thus, for the van der Waals system, we have B 2 = (b k B T — a) and B k = k B T b k ^ 1 for all k > 3. 
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5.3.3 Ballistic dispersion 

For the ballistic dispersion e(p) = p 2 /2m we computed the density of states in eqn. 5.24. One finds 

—d 00 

C ^ T) = S dtti - l z- Jt = zs^ d r d/2 . (5.43) 


We then have 

B 2 (T) = T2-(^ +1 )-g s 1 A^ (5.44) 

B 3 (T) = (V^ +1 ) - 3-(^ +1 )) • 2gf X 2 T d . (5.45) 

Note that B 2 (T) is negative for bosons and positive for fermions. This is because bosons have a tendency to bunch 
and under certain circumstances may exhibit a phenomenon known as Bose-Einstein condensation (BEC). Fermions, 
on the other hand, obey the Pauli principle, which results in an extra positive correction to the pressure in the low 
density limit. 

We may also write 

n(T, z) = ±g s X T d Lid (±z) (5.46) 

and 

P (T,z) = ±g s k B TX T d Lu (±z) , (5.47) 



where 

Li ,(^ES ( 5 - 48 ) 



00 

z 



7T 
n— 1 



is the polylogarithm function 1 . Note that h\ q (z) obeys a recursion relation in its index, viz. 

z^U q (z) = U q _ 1 (z), (5.49) 

and that 

00 1 

Li 9 (!) = E ^ = ■ (5-50) 

1 ^ 

n— 1 



5.4 Entropy and Counting States 

Suppose we are to partition N particles among J possible distinct single particle states. How many ways O are 
there of accomplishing this task? The answer depends on the statistics of the particles. If the particles are fermions, 
the answer is easy: fl FD = (^) . For bosons, the number of possible partitions can be evaluated via the following 
argument. Imagine that we line up all the TV particles in a row, and we place J — 1 barriers among the particles, 
as shown below in Fig. 5.1. The number of partitions is then the total number of ways of placing the TV particles 
among these N + J — 1 objects (particles plus barriers), hence we have £! BE = ( N+ ^ For Maxwell-Boltzmann 
statistics, we take fi MB = J /N\. Note that f2 MB is not necessarily an integer, so Maxwell-Boltzmann statistics 
does not represent any actual state counting. Rather, it manifests itself as a common limit of the Bose and Fermi 
distributions, as we have seen and shall see again shortly. 

1 Several texts, such as Pathria and Reichl, write g q (z) for Li 9 (z). I adopt the latter notation since we are already using the symbol g for the 
density of states function 3(e) and for the internal degeneracy g. 
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Figure 5.1: Partitioning TV bosons into J possible states (N = 14 and J = 5 shown). The N black dots represent 
bosons, while the J — 1 white dots represent markers separating the different single particle populations. Here 

ri\ = 3, n-2 = 1, ns = 4, m = 2, and ns = 4. 



The entropy in each case is simply S = k B \nfl. We assume N ^> 1 and J ^> 1, with n = N/J finite. Then using 
Stirling's approximation, \n(K\) = K In K — K + C(ln K), we have 



S wb = -Jk B n\nn 

S BE = —Jk B [nlnn — (1 + n) ln(l + n)] 
S FD = —Jk B [nlnn + (1 — n) ln(l — n)] . 
In the Maxwell-Boltzmann limit, n«l, and all three expressions agree. Note that 



\ dN 



dN 



f dS FD 
\ dN 



-k„ (1 + Inn) 



K lnfn 1 + l] 



= /Llnfn- 1 - 1 



(5.51) 



(5.52) 



Now let's imagine grouping the single particle spectrum into intervals of J consecutive energy states. If J is finite 
and the spectrum is continuous and we are in the thermodynamic limit, then these states will all be degenerate. 
Therefore, using a as a label for the energies, we have that the grand potential J? = E — TS — fiN is given in each 
case by 

a 

fi BE = J Y1 [( £ « ~ V)n a + K Tn a ^ n a - K T ( l + ™JM 1 + n a ) (553) 

a 



a 



Now - lo and behold! - treating J? as a function of the distribution {n a } and extremizing in each case, subject 
to the constraint of total particle number N = JJ2 a n a> one obtains the Maxwell-Boltzmann, Bose-Einstein, and 
Fermi-Dirac distributions, respectively: 



Sn„ 



'n MB = e ^~ 6 °^ fcBT 



be = [ e ( £a -M)A B T_ 1 ]- 1 
n™ = [e^a-^/VT + i]- 1 



(5.54) 



As long as J is finite, so the states in each block all remain at the same energy, the results are independent of J. 
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5.5 Photon Statistics 



5.5.1 Thermodynamics of the photon gas 

There exists a certain class of particles, including photons and certain elementary excitations in solids such as 
phonons (i.e. lattice vibrations) and magnons (i.e. spin waves) which obey bosonic statistics but with zero chemical 
potential. This is because their overall number is not conserved (under typical conditions) - photons can be 
emitted and absorbed by the atoms in the wall of a container, phonon and magnon number is also not conserved 
due to various processes, etc. In such cases, the free energy attains its minimum value with respect to particle 
number when 

'dF\ 



dN ), 



= . (5.55) 

T.V 



The number distribution, from eqn. 5.15, is then 



n(e) = . (5.56) 

The grand partition function for a system of particles with /i = is 

oo 

Q(T, V) = Vk B T J de g(e) In (l - e - e / k ^ T ) , (5.57) 

— oo 

where g(e) is the density of states per unit volume. 

Suppose the particle dispersion is e(p) = ^4|p| cr . We can compute the density of states 17(e): 



9(e) = gj^ S(e - A\pr) 

OO 

= ^Ajdpp d - 1 8(e~A P °) 

° oo (5.58) 

= ^d A -°Jdxxt- 1 5(e-x) 



2 



g ( 



oT(d/2)\ hA ^h 



--1 



e" 6(e) 



where g is the internal degeneracy, due, for example, to different polarization states of the photon. We have used 
the result Q d = 2n d / 2 /T{d/2) for the solid angle in d dimensions. The step function 0(e) is perhaps overly formal, 
but it reminds us that the energy spectrum is bounded from below by e = 0, i.e. there are no negative energy 
states. 

For the photon, we have e(p) = cp, hence a = 1 and 

2s ■K d / 2 e d_1 

^-wmwr Q{£) - {559) 



In d = 3 dimensions the degeneracy is g = 2, the number of independent polarization states. The pressure p(T) is 
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then obtained using fi = —pV. We have 

oo 

p(T) = -k B T Jdeg(e) In (l - e - e/k * T ) 

— OO 

d/2 °° 

2 (hc)- d k B Tjdee d ~ 1 In (l - e^^) 



a 

oo 

cZ-1 



II 



r(d/2) 

2g^ d / 2 (fc B T) d+1 
~ r(d/2) ~[hcj d ~ 

We can make some progress with the dimensionless integral: 

oo 

l d = - Jdtt d - 1 ln(l- e^) 
o 

„ oo 



■3 W<J±T\ ( fc B r ) d+ 1 



The number density is found to be 

oo 

9(e) 



n{T) = Jde 



e e/k B T _ 1 

— OO 



he 



For photons in d = 3 dimensions, we have g = 2 and thus 



It turns out that £(4) = 

Note that ftc//c B = 0.22855 cm • K, so 



(5.60) 



= jri (dtt d - l e- nt (5-61) 

n=l 71 Q 

oo 1 

- r WE^TT- r ( d )^ + l)- 

?7 = 1 

Finally, we invoke a result from the mathematics of the gamma function known as the doubling formula, 

i» = ^r(i)r(^) . (5.62) 

Putting it all together, we find 



p(T) = gv-^ r(*±l) C(d + 1) • (5.63) 



(5.64) 



r7 * 2C(3)/fc B T\ 3 2C(4) (fc B T) 4 



-j— = 4.3755 T[K] enf 1 =s> n(T) = 20.405 x T [K ] cm^ . (5.66) 
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To find the entropy, we use Gibbs-Duhem: 

du = = — s dT + v dp => s = v , (5.67) 

dl 

where s is the entropy per particle and v = n^ 1 is the volume per particle. We then find 

s (T) = (d+l)^±±±k B . (5.68) 
The entropy per particle is constant. The internal energy is 



and hence the energy per particle is 



E = -°^ = -±(fipV) = d.pV, (5.69) 



E , d-C(d+l) , 

s=-=d-pv= y > k B T. (5.70) 



5.5.2 Classical arguments for the photon gas 

A number of thermodynamic properties of the photon gas can be determined from purely classical arguments. 
Here we recapitulate a few important ones. 

1. Suppose our photon gas is confined to a rectangular box of dimensions L x x L y x L z . Suppose further 
that the dimensions are all expanded by a factor A 1 / 3 , i.e. the volume is isotropically expanded by a factor 
of A. The cavity modes of the electromagnetic radiation have quantized wavevectors, even within classical 
electromagnetic theory given by 

fe= (^'^'x")- (5 " 71) 

Since the energy for a given mode is e(k) = Hc\k\, we see that the energy changes by a factor A -1 / 3 under an 
adiabatic volume expansion V — ► XV, where the distribution of different electromagnetic mode occupancies 
remains fixed. Thus, 



Thus, 

P=-(—) =~ 
P \dv) s W ' 

as we found in eqn. 5.69. Since E = E(T, V) is extensive, we must have p = p(T) alone. 
2. Since p = p{T) alone, we have 



(5.73) 



dE\ _ /dE\ 

W) T - [dv) p ~ 



3p 



(5.74) 



where the second line follows the Maxwell relation (§y) = (§r)v' a ^ er invoking the First Law dE = 
TdS-pdV. Thus, 

T lT= Ap ^ P(T)=AT\ (5.75) 
where A is a constant. Thus, we recover the temperature dependence found microscopically in eqn. 5.63. 
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3. Given an energy density E/V, the differential energy flux emitted in a direction 9 relative to a surface normal 
is 

E n dC2 lrn ^ 
dj e = c ■ — ■ cos B ■ — , (5.76) 

V 47T 

where dfl is the differential solid angle. Thus, the power emitted per unit area is 

tt/2 2tt 

Ap „ZP f f r V 

dA^AnV J d0 J d ^ sin0 • cos0 = — = | cp(T) ^*T\ (5.77) 
o o 

where a = \cA, with p(T) = A T A as we found above. From quantum statistical mechanical considerations, 
we have 

(578) 

is Stefan's constant. 



5.5.3 Surface temperature of the earth 

We derived the result P = aT 4 ■ A where a = 5.67 x 1CP 8 W/m 2 K 4 for the power emitted by an electromagnetic 
'black body'. Let's apply this result to the earth-sun system. We'll need three lengths: the radius of the sun 
R Q = 6.96 x 10 8 m, the radius of the earth R c = 6.38 x 10 6 m, and the radius of the earth's orbit a c = 1.50 x 10 11 m. 
Let's assume that the earth has achieved a steady state temperature of T e . We balance the total power incident 
upon the earth with the power radiated by the earth. The power incident upon the earth is 

incident = ^ " ^ ' ^ = ^ ■ KoTq . (5.79) 



The power radiated by the earth is 
Setting P incidcnt = P radiatcd , we obtain 



Radiated = <^e " ■ (5-80) 



T e =(^ T . (5.81) 



Thus, we find T c = 0.04817 T Q , and with T Q = 5780 K, we obtain T c = 278.4 K. The mean surface temperature 
of the earth is T c = 287 K, which is only about 10 K higher. The difference is due to the fact that the earth is not 
a perfect blackbody, i.e. an object which absorbs all incident radiation upon it and emits radiation according to 
Stefan's law. As you know, the earth's atmosphere retraps a fraction of the emitted radiation - a phenomenon 
known as the greenhouse effect. 



5.5.4 Distribution of blackbody radiation 

Recall that the frequency of an electromagnetic wave of wavevector k is v = c/A = ck/2n. Therefore the number 
of photons N T {v, T) per unit frequency in thermodynamic equilibrium is (recall there are two polarization states) 

_2V d 3 k _ V_ k 2 dk 

Al [y, T) dv - — ■ ^ hck/k ^ T _ ^ - ■ ehc k/k B T _ i ■ (5.82) 

We therefore have 

W">T)=*^-. efw/ £ T _ i . (5.83) 
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frequency v [10 14 Hz] 
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Figure 5.2: Spectral density p e (u, T) for blackbody radiation at three temperatures. 

Since a photon of frequency v carries energy hv, the energy per unit frequency £(v) is 

BTrhV v 3 



£{v,T) 



£.3 ^hv fk u T -y 



(5.84) 



Note what happens if Planck's constant h vanishes, as it does in the classical limit. The denominator can then be 
written 

hv 



hu/k B T 



b j - 1 



and 



e oL (u,T) = pmE(v)=V 



Oth 2 



8irk„T 



(5.85) 



(5.86) 



In classical electromagnetic theory then, the total energy integrated over all frequencies diverges. This is known 
as the ultraviolet catastrophe, since the divergence comes from the large v part of the integral, which in the optical 
spectrum is the ultraviolet portion. With quantization, the Bose-Einstein factor imposes an effective ultraviolet 
cutoff k B T/h on the frequency integral, and the total energy, as we found above, is finite: 



oo 

E(T) = Jdv £{v) = 3pV = V- y 



(fc B T) 4 
15 {he) 3 



We can define the spectral density p e iy) of the radiation as 



£(v, T) 15 h {hv/KTf 



(5.87) 



(5.88) 



E{T) 7T 4 k B T e ^/k B T _ 1 
so that p e (u, T) dv is the fraction of the electromagnetic energy, under equilibrium conditions, between frequencies 

oo 

v and v + dv, i.e. J dv p e (v, T) = 1. In fig. 5.2 we plot this in fig. 5.2 for three different temperatures. The maximum 

o 

occurs when s = hv/k B T satisfies 



ds \ e s — 1 



1 - e- 



s = 2.82144 



(5.89) 
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5.5.5 What if the sun emitted ferromagnetic spin waves? 

We saw in eqn. 5.76 that the power emitted per unit surface area by a blackbody is crT 4 . The power law here 
follows from the ultrarelativistic dispersion e = Tick of the photons. Suppose that we replace this dispersion with 
the general form e = e(k). Now consider a large box in equilibrium at temperature T. The energy current incident 
on a differential area dA of surface normal to z is 

dp = dA 7(^3 <»<«•) ■ 4» ■ \^ ■ Tiw^brr ■ < 5 - 9 °> 

Let us assume an isotropic power law dispersion of the form e(k) = Ck a . Then after a straightforward calculation 
we obtain 

where 

CT = C(2 + f)r(2 + f).^L^i. (5.92) 
One can check that for g = 2, C = he, and a = 1 that this result reduces to that of eqn. 5.78. 



5.6 Lattice Vibrations : Einstein and Debye Models 

Crystalline solids support propagating waves called phonons, which are quantized vibrations of the lattice. Recall 
that the quantum mechanical Hamiltonian for a single harmonic oscillator, H = ^— + ^muj^q 2 , may be written as 
H = hu} {a)a + \), where a and a) are 'ladder operators' satisfying commutation relations [a , aJ\ = 1. 



5.6.1 One-dimensional chain 



Consider the linear chain of masses and springs depicted in fig. 5.3. We assume that our system consists of N mass 
points on a large ring of circumference L. In equilibrium, the masses are spaced evenly by a distance b = L/N. 
That is, = nb is the equilibrium position of particle n. We define u n = x n - 
position of mass n and The Hamiltonian is then 



to be the difference between the 



H 



E 



E 



2m + 



2m 



a? 



(5.93) 



Nn(b-a)' 2 



where a is the unstretched length of each spring, m is the mass of each mass point, k is the force constant of each 
spring, and TV is the total number of mass points, lib ^ a the springs are under tension in equilibrium, but as we 
see this only leads to an additive constant in the Hamiltonian, and hence does not enter the equations of motion. 

The classical equations of motion are 



Pn 



9H_ = Pn 

dp n m 
dH 
du„, 



K (u n+1 + u n _ x - 2u n ) 



(5.94) 
(5.95) 
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Figure 5.3: A linear chain of masses and springs. The black circles represent the equilibrium positions of the 
masses. The displacement of mass n relative to its equilibrium value is u n . 



Taking the time derivative of the first equation and substituting into the second yields 



m 

We now write 



u„ = — (w„+i + «„-i - 2u„) • (5.96) 



l=J2^e ikna , (5.97) 



N k 



where periodicity u N+n = u n requires that the k values are quantized so that e lkNa = 1, i.e. k = 2nj /Na where 
j € {0,1, . . . , N— 1}. The inverse of this discrete Fourier transform is 

^ = -^Y, U n^ kna ■ (5.98) 



Note that ii k is in general complex, but that u* k = u_ k . In terms of the u k , the equations of motion take the form 



* TO 



2k 

— (1 - cos(fca)) u k = -Lu 2 k u k . (5.99) 



Thus, each u k is a normal mode, and the normal mode frequencies are 



in 

The density of states for this band of phonon excitations is 

7r I a 



sin(±fca)|. (5.100) 



f dk 

9(s) = / S(e - hu> k ) 

J la (5-101) 



— (j 2 -e 2 y 1/2 0(e) S(J-e) 



where J = 2hy / n/m is the phonon bandwidth. The step functions require < e < J; outside this range there are 
no phonon energy levels and the density of states accordingly vanishes. 

The entire theory can be quantized, taking [p n , u n ,\ = ~ih5 nn ,. We then define 

j= £ P k e* kna , Pk = ^=Y,Pn ^ . (5-102) 



in which case [p k , ii k ,] = —ihS kk ,. Note that u k = u_ k and p k = p_ k . We then define the ladder operator 
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and its Hermitean conjugate a\, in terms of which the Hamiltonian is 

H = J2^ k {a{a k + l), (5.104) 

k 

which is a sum over independent harmonic oscillator modes. Note that the sum over k is restricted to an interval 
of width 2tt, e.g. fee [— ~ ] , which is the first Brillouin zone for the one-dimensional chain structure. The state at 
wavevector k + ^ is identical to that at k, as we see from eqn. 5.98. 



5.6.2 General theory of lattice vibrations 

The most general model of a harmonic solid is described by a Hamiltonian of the form 

6 = E 4^ + \ E E E «?(«)<(« - #) «f (#) . (5-ios) 



where the dynamical matrix is 



2M, 2 

-R,i 1 1,3 a,/3 R .R' 



- i?') = a , (5.106) 

where U is the potential energy of interaction among all the atoms. Here we have simply expanded the potential 
to second order in the local displacements uf(R). The lattice sites R are elements of a Bravais lattice. The indices i 
and j specify basis elements with respect to this lattice, and the indices a and /3 range over {1, . . . , d}, the number 
of possible directions in space. The subject of crystallography is beyond the scope of these notes, but, very briefly, 
a Bravais lattice in d dimensions is specified by a set of d linearly independent primitive direct lattice vectors a u such 
that any point in the Bravais lattice may be written as a sum over the primitive vectors with integer coefficients: 
R = 5Z; =1 n l a v The set of all such vectors {R} is called the direct lattice. The direct lattice is closed under the 
operation of vector addition: if R and R! are points in a Bravais lattice, then so is R + R' . 

A crystal is a periodic arrangement of lattice sites. The fundamental repeating unit is called the unit cell. Not 
every crystal is a Bravais lattice, however. Indeed, Bravais lattices are special crystals in which there is only one 
atom per unit cell. Consider, for example, the structure in fig. 5.4. The blue dots form a square Bravais lattice 
with primitive direct lattice vectors a 1 = ax and a 2 = ay, where a is the lattice constant, which is the distance 
between any neighboring pair of blue dots. The red squares and green triangles, along with the blue dots, form 
a basis for the crystal structure which label each sublattice. Our crystal in fig. 5.4 is formally classified as a square 
Bravais lattice with a three element basis. To specify an arbitrary site in the crystal, we must specify both a direct 
lattice vector R as well as a basis index j e {1, . . . , r}, so that the location is R + rjj. The vectors {Vj} are the basis 
vectors for our crystal structure. We see that a general crystal structure consists of a repeating unit, known as a 
unit cell. The centers (or corners, if one prefers) of the unit cells form a Bravais lattice. Within a given unit cell, the 
individual sublattice sites are located at positions rjj with respect to the unit cell position R. 

Upon diagonalization, the Hamiltonian of eqn. 5.105 takes the form 

H = £ hw a {k) (4(fe) A a (k) + I) , (5.107) 

k,a 

where 

[A a (k),Al(k')]=S ab S kk ,. (5.108) 
The eigenfrequencies are solutions to the eigenvalue equation 

^*?f{k)e$(k)=M i u£{k)efc\k) , (5.109) 

3,0 
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Figure 5.4: A crystal structure with an underlying square Bravais lattice and a three element basis. 



where 



<I> 



^(fc) =£<(!?) e" 
R 



ik R 



(5.110) 



Here, k lies within the first Brillouin zone, which is the unit cell of the reciprocal lattice of points G satisfying 
e iG R _ j £ or a jj q anc j jY\e reciprocal lattice is also a Bravais lattice, with primitive reciprocal lattice vectors 
b ; , such that any point on the reciprocal lattice maybe written G = Yli=i m i fy- One also has that a l ■ b v = 2ir5 w . 
The index a ranges from 1 to d ■ r and labels the mode of oscillation at wavevector k. The vector e^(fc) is the 
polarization vector for the a th phonon branch. In solids of high symmetry, phonon modes can be classified as 
longitudinal or transverse excitations. 

For a crystalline lattice with an r-element basis, there are then d ■ r phonon modes for each wavevector k lying 
in the first Brillouin zone. If we impose periodic boundary conditions, then the k points within the first Brillouin 
zone are themselves quantized, as in the d = 1 case where we found k = 2im/N . There are TV distinct k points in 
the first Brillouin zone - one for every direct lattice site. The total number of modes is than d ■ r ■ N, which is the 
total number of translational degrees of freedom in our system: rN total atoms (N unit cells each with an r atom 
basis) each free to vibrate in d dimensions. Of the d ■ r branches of phonon excitations, d of them will be acoustic 
modes whose frequency vanishes as k — > 0. The remaining d(r — 1) branches are optical modes and oscillate at finite 
frequencies. Basically, in an acoustic mode, for k close to the (Brillouin) zone center k = 0, all the atoms in each 
unit cell move together in the same direction at any moment of time. In an optical mode, the different basis atoms 
move in different directions. 



There is no number conservation law for phonons - they may be freely created or destroyed in anharmonic pro- 
cesses, where two photons with wavevectors fc and q can combine into a single phonon with wavevector k + q, 
and vice versa. Therefore the chemical potential for phonons is p, = 0. We define the density of states g a (u) for the 
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a th phonon mode as 



(5.111) 



where is the number of unit cells, V is the unit cell volume of the direct lattice, and the k sum and integral are 
over the first Brillouin zone only. Note that u here has dimensions of frequency. The functions g a (w) is normalized 
to unity: 

oo 

(5.112) 



l dujg a (ui) = 1 

The total phonon density of states per unit cell is given by 2 

dr 

sM = £&*( 



(5.113) 



a=l 



The grand potential for the phonon gas is 

oo 

r2(T,y) = -fc B TlnJ]; £ e- /Mh -W(».W+*) 

fc,a n o (fc)=0 



fc B T£ln 

k.a 



2 sinh 



Nk B T I dui g(uj) In 



V 2fc B r 

2 sinh 



(5.114) 



Note that V = N V since there are TV unit cells, each of volume V ■ The entropy is given by S = — ( ) y and thus 
the heat capacity is 

\2 



2kZf 



csch^ 



C v = -T^=Nk B Jdug(u;) 

o 

Note that as T — > oo we have csch( ^[pf ) — * ~j£r~/ an d therefore 

oo 

lim C V (T) = Nk B fdw g{u) = rdNk 



hw 
2k~f 



(5.115) 



(5.116) 



This is the classical Dulong-Petit limit of ^fc B per quadratic degree of freedom; there are rN atoms moving in d 
dimensions, hence d ■ rN positions and an equal number of momenta, resulting in a high temperature limit of 

C v = rdNk B . 



5.6.3 Einstein and Debye models 



Historically two models of lattice vibrations have received wide attention. First is the so-called Einstein model, in 
which there is no dispersion to the individual phonon modes. We approximate g a (u>) ~ 6(u> — ui a ), in which case 

2 Note the dimensions of g(u>) are (frequency) -1 . By contrast, the dimensions of g(e) in eqn. 5.24 are (energy) -1 ■ (volume) -1 . The 
difference lies in the a factor of V ■ h, where V is the unit cell volume. 
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Figure 5.5: Upper panel: phonon spectrum in elemental rhodium (Rh) at T = 297 K measured by high precision 
inelastic neutron scattering (INS) by A. Eichler et ah, Phys. Rev. B 57, 324 (1998). Note the three acoustic branches 
and no optical branches, corresponding to d = 3 and r = 1. Lower panel: phonon spectrum in gallium arsenide 
(GaAs) at T = 12 K, comparing theoretical lattice-dynamical calculations with INS results of D. Strauch and B. 
Dorner, /. Phys.: Condens. Matter 2, 1457 (1990). Note the three acoustic branches and three optical branches, 
corresponding to d = 3 and r = 2. The Greek letters along the ir-axis indicate points of high symmetry in the 
Brillouin zone. 



At low temperatures, the contribution from each branch vanishes exponentially, since csch 2 ( ) — 4 e huJ <*/ k B T _ 
0. Real solids don't behave this way. 

A more realistic model, due to Debye, accounts for the low-lying acoustic phonon branches. Since the acoustic 
phonon dispersion vanishes linearly with |fe| as A; — > 0, there is no temperature at which the acoustic phonons 
'freeze out' exponentially, as in the case of Einstein phonons. Indeed, the Einstein model is appropriate in describ- 
ing the d (r— 1) optical phonon branches, though it fails miserably for the acoustic branches. 

In the vicinity of the zone center k = (also called r in crystallographic notation) the d acoustic modes obey a 
linear dispersion, with oj a (k) = c a (k) k. This results in an acoustic phonon density of states in d = 3 dimensions 
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of 



2ir 2 

3V n 



Ef dk 
/ TZ 



1 



4tt C 3(fc) 



6K - lo) 



(5.118) 



w 2 e(cj D -w) , 



where c is an average acoustic phonon velocity (i.e. speed of sound) defined by 



and uj d is a cutoff known as the Debye frequency. The cutoff is necessary because the phonon branch does not 
extend forever, but only to the boundaries of the Brillouin zone. Thus, u> should roughly be equal to the energy 
of a zone boundary phonon. Alternatively, we can define lu d by the normalization condition 

oo 

Jduj g(u>) = 3 => uj d = (6tt 2 /V ) 1/3 c . (5.120) 
o 

This allows us to write g(cj) = (9w 2 /w 3 ) 0(w D — ui). 
The specific heat due to the acoustic phonons is then 



o 

/ 2T\ 3 



2 ( fa* 1 



Oh T 

- (5m) 

3 



where D = fku D /k B is the Debye temperature and 



±a; 3 a; 

2j 



0(x) = /d^ 4 csch 2 < = { (5.122) 

30 X 00 ■ 



o 



Therefore, 

3 



3iVfc B T > <~> D 



Thus, the heat capacity due to acoustic phonons obeys the Dulong-Petit rule in that C V (T — > oo) = 3Nk B , cor- 
responding to the three acoustic degrees of freedom per unit cell. The remaining contribution of 3(r — l)Nk B to 
the high temperature heat capacity comes from the optical modes not considered in the Debye model. The low 
temperature T 3 behavior of the heat capacity of crystalline solids is a generic feature, and its detailed description 
is a triumph of the Debye model. 
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Element 


Ag 


Al 


Au 


C 


Cd 


Cr 


Cu 


Fe 


Mn 


(K) 


227 


433 


162 


2250 


210 


606 


347 


477 


409 


Tmclt (K) 


962 


660 


1064 


3500 


321 


1857 


1083 


1535 


1245 


Element 


Ni 


Pb 


Pt 


Si 


Sn 


Ta 


Ti 


W 


Zn 


O d (K) 


477 


105 


237 


645 


199 


246 


420 


383 


329 


T me it (K) 


1453 


327 


1772 


1410 


232 


2996 


1660 


3410 


420 



Table 5.1: Debye temperatures (at T = 0) and melting points for some common elements (carbon is assumed to be 
diamond and not graphite). (Source: the internet!) 



5.6.4 Melting and the Lindemann criterion 

Atomic fluctuations in a crystal 

For the one-dimensional chain, eqn. 5.103 gives 

Therefore the RMS fluctuations at each site are given by 

(««) = -^Yl^k^-k 

k 



di. — a 



-k) 



(5.124) 



(5.125) 



N * — ' mw t 

k 



where n(k, T) = [cxp(hu> k /k- B T) ~ l] 1 is the Bose occupancy function. 

Let us now generalize this expression to the case of a rf-dimensional solid. The appropriate expression for the RMS 
position fluctuations of the i basis atom in each unit cell is 



K(fe) 



(5.126) 



Here we sum over all wavevectors fc in the first Brilliouin zone, and over all normal modes a. There are dr normal 
modes per unit cell i.e. d branches of the phonon dispersion u> a (k). (For the one-dimensional chain with d = 1 and 
r = 1 there was only one such branch to consider). Note also the quantity M ia (k), which has units of mass and is 
defined in terms of the polarization vectors ej„ (fe) as 



1 



M la {k) 



(5.127) 



The dimensions of the polarization vector are [mass] , since the generalized orthonormality condition on the 
normal modes is 

(5.128) 



where M i is the mass of the atom of species i within the unit cell (i € {1, . . . , r}). For our purposes we can replace 
M ia (k) by an appropriately averaged quantity which we call M i ; this 'effective mass' is then independent of the 
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mode index a as well as the wavevector k. We may then write 

(5.129) 

o J 

where we have dropped the site label R since translational invariance guarantees that the fluctuations are the 
same from one unit cell to the next. Note that the fluctuations ( uf ) can be divided into a temperature-dependent 
part ( uf ) th and a temperature-independent quantum contribution ( w? ) , where 

oo 


oo 

.2\ _ h L.9(") 



>«>qu = ^/^— ■ (5-131) 



Let's evaluate these contributions within the Debye model, where we replace g{uj) by 



We then find 



/H = - w) . (5.132) 



>'>-« = 3^1 -5b£=-- (5134) 



where 

„d-2 




We can now extract from these expressions several important conclusions: 



1) The T = contribution to the the fluctuations, ( uf ) , diverges in d = 1 dimensions. Therefore there are no 
one-dimensional quantum solids. 

2) The thermal contribution to the fluctuations, ( uf ) th , diverges for any T > whenever d < 2. This is because 
the integrand of F d (x) goes as s d ~ 3 as s — > 0. Therefore, there are no two-dimensional classical solids. 

3) Both the above conclusions are valid in the thermodynamic limit. Finite size imposes a cutoff on the fre- 
quency integrals, because there is a smallest wavevector fc min ~ 2ir/L, where L is the (finite) linear dimen- 
sion of the system. This leads to a low frequency cutoff cj min = 2ttc/ L, where c is the appropriately averaged 
acoustic phonon velocity from eqn. 5.119, which mitigates any divergences. 



Lindemann melting criterion 



An old phenomenological theory of melting due to Lindemann says that a crystalline solid melts when the RMS 
fluctuations in the atomic positions exceeds a certain fraction r) of the lattice constant a. We therefore define the 
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ratios 



^ s ^ = * ■ I m^t J • ir ■ ^ d/t) (5136) 



(«?)qu d 2 ( K 2 \ 1 



^,qu - fl2 ~ 2 (d - 1) V M * a2 fc B / ' ' (5 ' 137) 




with x t = ^Jx 2 th + x 2 qu = y/(u'f) I a. 

Let's now work through an example of a three-dimensional solid. We'll assume a single element basis (r = 1). We 
have that 

9fi 2 /4fce ^ 1Q9K (513g) 

1 amu A 

According to table 5.1, the melting temperature always exceeds the Deb ye temperature, and often by a great 
amount. We therefore assume T » 6> D , which puts us in the small x limit of F d (x). We then find 

where 

0* = 109 K 2 . (5.140) 

M[amu] ■ (a [A]) 

The total position fluctuation is of course the sum x 2 = x 2 th + x 2 qu . Consider for example the case of copper, with 
M = 56 amu and a = 2.87 A. The Debye temperature is <9 D = 347 K. From this we find x qu = 0.026, which says 
that at T = the RMS fluctuations of the atomic positions are not quite three percent of the lattice spacing (i.e. 
the distance between neighboring copper atoms). At room temperature, T = 293 K, one finds x th = 0.048, which 
is about twice as large as the quantum contribution. How big are the atomic position fluctuations at the melting 
point? According to our table, T molt = 1083 K for copper, and from our formulae we obtain x molt = 0.096. The 
Lindemann criterion says that solids melt when x(T) w 0.1. 

We were very lucky to hit the magic number x mclt = 0.1 with copper. Let's try another example. Lead has 
M = 208 amu and a = 4.95 A. The Debye temperature is D = 105 K ('soft phonons'), and the melting point is 
T mdt = 327 K - From these data we obtain x(T = 0) = 0.014, x(293K) = 0.050 and x{T = 327 K) = 0.053. Same 
ballpark. 

We can turn the analysis around and predict a melting temperature based on the Lindemann criterion x(T mclt ) = rj, 
where rj ~ 0.1. We obtain 

We call T L the Lindemann temperature. Most treatments of the Lindemann criterion ignore the quantum correction, 
which gives the — 1 contribution inside the above parentheses. But if we are more careful and include it, we see 
that it may be possible to have T L < 0. This occurs for any crystal where 6> D < (9* /rj 2 . 

Consider for example the case of 4 He, which at atmospheric pressure condenses into a liquid at T c = 4.2 K and 
remains in the liquid state down to absolute zero. At p = 1 atm, it never solidifies! Why? The number density of 
liquid 4 He at p = 1 atm and T = OK is 2.2 x 10 22 cm 3 . Let's say the Helium atoms want to form a crystalline 
lattice. We don't know a priori what the lattice structure will be, so let's for the sake of simplicity assume a simple 
cubic lattice. From the number density we obtain a lattice spacing of a = 3.57 A. OK now what do we take for 
the Debye temperature? Theoretically this should depend on the microscopic force constants which enter the 
small oscillations problem (i.e. the spring constants between pairs of helium atoms in equilibrium). We'll use the 



206 CHAPTER 5. NONINTERACTING QUANTUM SYSTEMS 



expression we derived for the Debye frequency, cj d = (67r 2 /V ) 1/,3 c, where V is the unit cell volume. We'll take 
c = 238 m/s, which is the speed of sound in liquid helium at T = 0. This gives O d = 19.8 K. We find 0* = 2.13 K, 
and if we take rj = 0.1 this gives 0* /if = 213 K, which significantly exceeds <9 D . Thus, the solid should melt 
because the RMS fluctuations in the atomic positions at absolute zero are huge: x qu = (0*/0 D )V 2 = 0.33. By 
applying pressure, one can get 4 He to crystallize above p c = 25 atm (at absolute zero). Under pressure, the unit 
cell volume V decreases and the phonon velocity c increases, so the Debye temperature itself increases. 

It is important to recognize that the Lindemann criterion does not provide us with a theory of melting per se. 
Rather it provides us with a heuristic which allows us to predict roughly when a solid should melt. 



5.6.5 Goldstone bosons 



The vanishing of the acoustic phonon dispersion at fc = is a consequence of Goldstone's theorem which says that 
associated with every broken generator of a continuous symmetry there is an associated bosonic gapless excitation 
(i.e. one whose frequency u> vanishes in the long wavelength limit). In the case of phonons, the 'broken generators' 
are the symmetries under spatial translation in the x, y, and z directions. The crystal selects a particular location 
for its center-of-mass, which breaks this symmetry. There are, accordingly, three gapless acoustic phonons. 

Magnetic materials support another branch of elementary excitations known as spin waves, or magnons. In 
isotropic magnets, there is a global symmetry associated with rotations in internal spin space, described by the 
group SU(2). If the system spontaneously magnetizes, meaning there is long-ranged ferromagnetic order (||f 
• • ■ ), or long-ranged antiferromagnetic order (T4T4 ' ' ' )/ then global spin rotation symmetry is broken. Typically 
a particular direction is chosen for the magnetic moment (or staggered moment, in the case of an antiferromag- 
net). Symmetry under rotations about this axis is then preserved, but rotations which do not preserve the selected 
axis are 'broken'. In the most straightforward case, that of the antiferromagnet, there are two such rotations for 
SU(2), and concomitantly two gapless magnon branches, with linearly vanishing dispersions w a (fc). The situation 
is more subtle in the case of ferromagnets, because the total magnetization is conserved by the dynamics (unlike 
the total staggered magnetization in the case of antiferromagnets). Another wrinkle arises if there are long-ranged 
interactions present. 

For our purposes, we can safely ignore the deep physical reasons underlying the gaplessness of Goldstone bosons 
and simply posit a gapless dispersion relation of the form uj(k) = A \ k\ a . The density of states for this excitation 
branch is then 

g(oj)=cJ- 1 e(cj c -u;), (5.142) 



where C is a constant and uj c is the cutoff, which is the bandwidth for this excitation branch. 3 Normalizing the 
density of states for this branch results in the identification uj c = (d/aCy/ d . 

The heat capacity is then found to be 



CV = Nk„ C Idujuj' 1 ( ) csch 2 



(5.143) 

d {2T\ d/a , , , 



3 If u{k) = Ak a , thenC = 2 1 ~ d n 2 a' 1 A " g / T(d/2) . 
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where = hw c /k B and 

= fdtt° +1 csch 2 t = i (5.144) 
o ( 2 " d / CT r(2 + |)C(2 + |) x^cx), 

which is a generalization of our earlier results. Once again, we recover Dulong-Petit for k B T 3> fiw c , with C V (T ^> 

r^ c /fc B ) = iVfc B . 

In an isotropic ferromagnet, z.e.a ferromagnetic material where there is full SU(2) symmetry in internal 'spin' 
space, the magnons have a k 2 dispersion. Thus, a bulk three-dimensional isotropic ferromagnet will exhibit a heat 
capacity due to spin waves which behaves as T 3 / 2 at low temperatures. For sufficiently low temperatures this will 
overwhelm the phonon contribution, which behaves as T 3 . 



5.7 The Ideal Bose Gas 

5.7.1 General formulation for noninteracting systems 

Recall that the grand partition function for noninteracting bosons is given by 

(oo \ , 

£ ef>0>-'Jn a j = IX (l - e^~^) , (5.145) 
n a — / a 

In order for the sum to converge to the RHS above, we must have fj, < e a for all single-particle states |a). The 
density of particles is then 



where g(e) = y ~ £ a) 1S m e density of single particle states per unit volume. We assume that g(e) = for 

e < e ; typically e = 0, as is the case for any dispersion of the form e(k) = A\k\ r , for example. However, in the 
presence of a magnetic field, we could have e(k, a) = A\k\ r — g^i Q Ha, in which case e = —gfj, \H\. 

Clearly n(T, /i) is an increasing function of both T and [i. At fixed T, the maximum possible value for n(T, /it), 
called the critical density n c (T), is achieved for \i = e , i.e. 

n c (T) = Ids ^ 



0(e-e o ) - 1 



The above integral converges provided g{e Q ) = 0, assuming g(e) is continuous. If g{e Q ) ^ 0, the integral diverges, 
and n c (T) = oo. In this latter case, one can always invert the equation for n(T, //) to obtain the chemical potential 
n(T, n). In the former case, where the n c (T) is finite, we have a problem - what happens if n > n c {T) ? 

In the former case, where n c (T) is finite, we can equivalently restate the problem in terms of a critical temperature 
T c (n), defined by the equation n c (T c ) = n. For T < T c , we apparently can no longer invert to obtain fj,(T, n), so 
clearly something has gone wrong. The remedy is to recognize that the single particle energy levels are discrete, 
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and separate out the contribution from the lowest energy state e . I.e. we write 



^ ") 4 ^nri + h i*^rrr - (5147) 

where g is the degeneracy of the single particle state with energy e Q . We assume that n is finite, which means 
that iV = Vn is extensive. We say that the particles have condensed into the state with energy e . The quantity n Q 
is the condensate density. The remaining particles, with density n', are said to comprise the overcondensate. With the 
total density n fixed, we have n = n Q + n' . Note that n finite means that fj, is infinitesimally close to e : 

Note also that if e — /i is finite, then n oc V^ 1 is infinitesimal. 
Thus, for T < T c (n), we have fi = e with n > 0, and 

oo 

n(T,n ) = n + Jde e(g _J^ r _ 1 • (5.149) 



For T > T c (n), we have n = and 



The equation for T c (n) is 



fe e(£ -i ( ; B T _ i • ( 5 - i5 °) 



oo 



3 (e-e )/ fc B T c - 1 



5.7.2 Ballistic dispersion 

We already derived, in §5.3.3, expressions for n(T, z) and p(T, z) for the ideal Bose gas (IBG) with ballistic disper- 
sion e(p) = p 2 /2m, We found 

n(T,z) = g\ T d Li # (z) (5.151) 

p(T,z)=gk B T\ T d Lu +1 0), (5.152) 

where g is the internal (e.g. spin) degeneracy of each single particle energy level. Here z = e M//fe B T is the fugacity 
and 

00 m 

Li *W = E h (5-153) 

m— 1 

is the polylogarithm function. For bosons with a spectrum bounded below by e = 0, the fugacity takes values on 
the interval z G [0, l] 4 . 



4 It is easy to see that the chemical potential for noninteracting bosons can never exceed the minimum value e of the single particle 
dispersion. 
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Figure 5.6: The polylogarithm function Li s (z) versus z for s = h,s= I, and s — | . Note that Li s ( 1 ) = £(s) diverges 
for s < 1. 

Clearly n(T, z) = g\~ d Lu(z) is an increasing function of z for fixed T. In fig. 5.6 we plot the function Li s (z) 

versus z for three different values of s. We note that the maximum value Li s (z = 1) is finite if s > 1. Thus, for 
d > 2, there is a maximum density n laax (T) = gLid (z) which is an increasing function of temperature T. Put 
another way, if we fix the density n, then there is a critical temperature T c below which there is no solution to the 
equation n = n(T, z). The critical temperature T c (n) is then determined by the relation 



:C(f) 



mk B T c 
2nh 2 



d/2 



2i:h 2 



;C(f) 



2/d 



(5.154) 



What happens for T < T c l 



As shown above in §5.7, we must separate out the contribution from the lowest energy single particle mode, which 
for ballistic dispersion lies at e = 0. Thus writing 



V z- 



1 V V 



-l e £ «/ fc B T - 1 



(5.155) 



where we have taken g = 1. Now V^ 1 is of course very small, since V is thermodynamically large, but if \i — > 
then z _1 — 1 is also very small and their ratio can be finite, as we have seen. Indeed, if the density of k = bosons 
n is finite, then their total number ./V satisfies 



N = Vn 



1 



l + jV " 



The chemical potential is then 



H = k B T In z = -k B T In (l + N^ 1 ) 



(5.156) 



(5.157) 



In other words, the chemical potential is infinitesimally negative, because N is assumed to be thermodynamically 
large. 



According to eqn. 5.14, the contribution to the pressure from the k = states is 

Po 



, . n ^ , . I 

ln(l-^) = ^ln(l + iV o )^0 H 



(5.158) 
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So the k = bosons, which we identify as the condensate, contribute nothing to the pressure. 

Having separated out the k = mode, we can now replace the remaining sum over a by the usual integral over 
k. We then have 

T<T C : n = n +gC(l)XT d (5.159) 

p = gC{i + l)k B T\^ d (5.160) 

and 

T>T C : n = gLi^(z) X T d (5.161) 

P = g^ i+l {z)K T XT d ■ (5-162) 

The condensate fraction n Q /n is unity at T = 0, when all particles are in the condensate with k = 0, and decreases 
with increasing T until T = T c , at which point it vanishes identically. Explicitly, we have 



»o(T) _ , f C(|) _ , f T 



- 1 -'-^- 1 -\^af ■ (5163) 



Let us compute the internal energy E for the ideal Bose gas. We have 



and therefore 



±Wfi) = fi + f}^=n-T¥L = fi + TS (5.164) 



E = Q + TS + tiN = nN + 4z (0ft) 

op 

= v(nn + -£ 3 (pp)) (5.165) 



d(3 



This expression is valid at all temperatures, both above and below T c . Note that the condensate particles do not 
contribute to E, because the k = condensate particles carry no energy. 

We now investigate the heat capacity C v N — (§y)y N - Since we have been working in the GCE, it is very 
important to note that N is held constant when computing C v N . We'll also restrict our attention to the case d = 3 
since the ideal Bose gas does not condense at finite T for d < 2 and d > 3 is unphysical. While we're at it, we'll 
also set g = 1. 



The number of particles is 



'N + ((I)V\t 3 (T<T c ) 
N = { (5.166) 

VX T 3 Li 3/2 (z) (T>T C ) , 



and the energy is 



E = I k B T ^U 5/2 (z) . (5.167) 
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Figure 5.7: Molar heat capacity of the ideal Bose gas. Note the cusp at T = T c . 



For T < T , we have z = 1 and 



The molar heat capacity is therefore 



a 



V,N 



9TJv,n 



¥ C(t) K tf ■ 



<Vat( T :™) = 



c 



N 



For T > T c , we have 



\ V — -4 -*B^ —5/21^; ^F'^TIV ^3/ 

where we have invoked eqn. 5.49. Taking the differential of N, we have 



V dT 



dN \ V = I Li 3/2( Z ) ■ Y" + Li l/ 2 (^) ^3" • Y " 



We set <i/V = 0, which fixes dz in terms of dT, resulting in 



c v,n(T, z) — 5 R 



Li 5/2( Z ) l Li 3/2( Z ) 



Li 3/2 (z) Li 1/2 (z) 



(5.168) 



(5.169) 



(5.170) 



(5.171) 



(5.172) 



To obtain c v N (T, n), we must invert the relation 

n(T,z) = Ay 3 Li 3/2 (z) 



(5.173) 



in order to obtain z(T, n), and then insert this into eqn. 5.172. The results are shown in fig. 5.7. There are several 
noteworthy features of this plot. First of all, by dimensional analysis the function c v N (T, n) is R times a function 
of the dimensionless ratio T/T c (n) oc T?i~ 2 / 3 . Second, the high temperature limit is which is the classical 
value. Finally, there is a cusp at T = T c (n). 
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Figure 5.8: Phase diagrams for the ideal Bose gas. Left panel: (p, v) plane. The solid blue curves are isotherms, 
and the green hatched region denotes v < v c (T), where the system is partially condensed. Right panel: (p, T) 
plane. The solid red curve is the coexistence curve p c (T), along which Bose condensation occurs. No distinct 
thermodynamic phase exists in the yellow hatched region above p = p c (T). 



5.7.3 Isotherms for the ideal Bose gas 

Let a be some length scale and define 



Then we have 

V 

p_ 

Pa 



Pa 



2irh 2 



2nh 2 
ma 2 k„ 



3/2 



5/2 



Li 3/2 0) + v a n 
Li 5/2( Z ) ' 



(5.174) 

(5.175) 
(5.176) 



where v = V/N is the volume per particle 5 and n is the condensate number density; v vanishes for T > T c , when 
2 = 1. Note that the pressure is independent of volume for T < T c . The isotherms in the (p, v) plane are then flat 
for v < v c . This resembles the coexistence region familiar from our study of the thermodynamics of the liquid-gas 
transition. 



Recall the Gibbs-Duhem equation, 

rf/i = — s dT + v dp . 
Along a coexistence curve, we have the Clausius-Clapeyron relation, 

So ~ Si £ 



dp\ 
dTj c 



TAv 



(5.177) 



(5.178) 



where £ = T (s 2 — s{) is the latent heat per mole, and Av = v 2 — l>i- For ideal gas Bose condensation, the 
coexistence curve resembles the red curve in the right hand panel of fig. 5.8. There is no meaning to the shaded 



5 Note that in the thermodynamics chapter we used v to denote the molar volume, N\ V/N. 
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Figure 5.9: Phase diagram of 4 He. All phase boundaries are first order transition lines, with the exception of the 
normal liquid-superfluid transition, which is second order. (Source: University of Helsinki) 



region where p > p c {T). Nevertheless, it is tempting to associate the curve p 
k = condensate and the remaining uncondensed (k ^ 0) bosons 6 . 



p c (T) with the coexistence of the 



The entropy in the coexistence region is given by 



1 

'N 



dTJv 



C(f) Kv\ 



-3 



c(i: 



c(i; 



i 



(5.179) 



All the entropy is thus carried by the uncondensed bosons, and the condensate carries zero entropy. The Clausius- 
Clapeyron relation can then be interpreted as describing a phase equilibrium between the condensate, for which 
s o — v a — 0' an d the uncondensed bosons, for which s' = s(T) and v' = v c (T). So this identification forces us to 
conclude that the specific volume of the condensate is zero. This is certainly false in an interacting Bose gas! 

While one can identify, by analogy, a 'latent heat' £ = T As = Ts in the Clapeyron equation, it is important to 
understand that there is no distinct thermodynamic phase associated with the region p > p c (T). Ideal Bose gas 
condensation is a second order transition, and not a first order transition. 



5.7.4 The A-transition in Liquid 4 He 

Helium has two stable isotopes. 4 He is a boson, consisting of two protons, two neutrons, and two electrons (hence 
an even number of fermions). 3 He is a fermion, with one less neutron than 4 He. Each 4 He atom can be regarded 
as a tiny hard sphere of mass to = 6.65 x 10~ 24 g and diameter a — 2.65 A. A sketch of the phase diagram is shown 
in fig. 5.9. At atmospheric pressure, Helium liquefies at TJ = 4.2 K. The gas-liquid transition is first order, as usual. 
However, as one continues to cool, a second transition sets in at T = T x = 2.17 K (atp = 1 atm). The A-transition, 
so named for the A-shaped anomaly in the specific heat in the vicinity of the transition, as shown in fig. 5.10, is 
continuous (i.e. second order). 

If we pretend that 4 He is a noninteracting Bose gas, then from the density of the liquid n = 2.2 x 10 22 cm -3 , we 

obtain a Bose-Einstein condensation temperature T c = ^^-(^/C(|)) 2 ^ 3 — 3.16 K, which is in the right ballpark. 
The specific heat C p (T) is found to be singular at T = T x , with 

C p (T)=A\T-T x (p)\- a . (5.180) 
6 The k ^ particles are sometimes called the overcondensate. 
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Figure 5.10: Specific heat of liquid 4 He in the vicinity of the A-transition. Data from M. J. Buckingham and W. M. 
Fairbank, in Progress in Low Temperature Physics, C. J. Gortner, ed. (North-Holland, 1961). Inset at upper right: 
more recent data of J. A. Lipa et at, Phys. Rev. B 68, 174518 (2003) performed in zero gravity earth orbit, to within 
AT = 2 nK of the transition. 

a is an example of a critical exponent. We shall study the physics of critical phenomena later on in this course. 
For now, note that a cusp singularity of the type found in fig. 5.7 corresponds to a = — 1. The behavior of 
C p (T) in 4 He is very nearly logarithmic in |T — TJ. In fact, both theory (renormalization group on the 0(2) 
model) and experiment concur that a is almost zero but in fact slightly negative, with a = —0.0127 ± 0.0003 in 
the best experiments (Lipa et ah, 2003). The A transition is most definitely not an ideal Bose gas condensation. 
Theoretically, in the parlance of critical phenomena, IBG condensation and the A-transition in 4 He lie in different 
universality classes 7 . Unlike the IBG, the condensed phase in 4 He is a distinct thermodynamic phase, known as a 
superfluid. 

Note that C p (T < T c ) for the IBG is not even defined, since for T < T c we have p = p{T) and therefore dp = 
requires dT = 0. 



5.7.5 Fountain effect in superfluid 4 He 

At temperatures T < T x , liquid 4 He has a superfluid component which is a type of Bose condensate. In fact, there 
is an important difference between condensate fraction N,_ Q /N and superfluid density, which is denoted by the 
symbol p s . In 4 He, for example, at T = the condensate fraction is only about 8%, while the superfluid fraction 



7 IBG condensation is in the universality class of the spherical model. The A-transition is in the universality class of the XY model. 
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Figure 5.11: The fountain effect. In each case, a temperature gradient is maintained across a porous plug through 
which only superfluid can flow. This results in a pressure gradient which can result in a fountain or an elevated 
column in a U-tube. 



pjp = 1. The distinction between 7V and p s is very interesting but lies beyond the scope of this course. 

One aspect of the superfluid state is its complete absence of viscosity. For this reason, superfluids can flow through 
tiny cracks called microleaks that will not pass normal fluid. Consider then a porous plug which permits the 
passage of superfluid but not of normal fluid. The key feature of the superfluid component is that it has zero 
energy density. Therefore even though there is a transfer of particles across the plug, there is no energy exchange, 
and therefore a temperature gradient across the plug can be maintained 8 . 

The elementary excitations in the superfluid state are sound waves called phonons. They are compressional waves, 
just like longitudinal phonons in a solid, but here in a liquid. Their dispersion is acoustic, given by uj(k) = ck 
where c = 238 m/s. 9 The have no internal degrees of freedom, hence g = 1. Like phonons in a solid, the phonons 
in liquid helium are not conserved. Hence their chemical potential vanishes and these excitations are described 
by photon statistics. We can now compute the height difference Ah in a U-tube experiment. 

Clearly Ah = Ap/pg. so we must find p(T) for the helium. In the grand canonical ensemble, we have 



(2^)3 ln ( X - 

OO 



,-hck/k B T\ 



(5.181) 



ir 2 (KT) 4 
90 {he) 3 



8 Recall that two bodies in thermal equilibrium will have identical temperatures if they are free to exchange energy. 
9 The phonon velocity c is slightly temperature dependent. 
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Let's assume T = 1 K. We'll need the density of liquid helium, p = 148 kg/m 3 . 

dh _ 2ir 2 fk B T\ 3 k B 
~dT ~ "45" \hc~) ~pg 

_ 2f_ / (1,38 x 1Q- 23 J/K)(1K) \ 3 (1.38 x 10~ 23 J/K) (5.182) 
45 v (1.055 x 10~ 34 J • s)(238m/s) J * (148 kg/m 3 ) (9.8 m/s 2 ) 

~ 32cm/K , 

a very noticeable effect! 



5.7.6 Bose condensation in optical traps 

The 2001 Nobel Prize in Physics was awarded to Weiman, Cornell, and Ketterle for the experimental observation 
of Bose condensation in dilute atomic gases. The experimental techniques required to trap and cool such systems 
are a true tour deforce, and we shall not enter into a discussion of the details here 10 . 

The optical trapping of neutral bosonic atoms, such as 87 Rb, results in a confining potential V(r) which is quadratic 
in the atomic positions. Thus, the single particle Hamiltonian for a given atom is written 

h 2 

H = V 2 + \m(u 2 x 2 +uW + <A z 2 ) , (5.183) 

2m z ' 

where ui 1 2 3 are the angular frequencies of the trap. This is an anisotropic three-dimensional harmonic oscillator, 
the solution of which is separable into a product of one-dimensional harmonic oscillator wavefunctions. The 
eigenspectrum is then given by a sum of one-dimensional spectra, viz. 

E ni ,n 2 ,n 3 = K + 3) fe«>i + (n 2 + i) huj 2 + (n 3 + (5.184) 

According to eqn. 5.16, the number of particles in the system is 

00 00 00 _ j 

_/V = ^ ^ Jy- 1 e »l fa l/ l B T e «2 Su 2/ t B T e «! SlJ 3/' : B T _ ] 



n n — n — 

1 \ / 1 \ / 1 



fc=i 



(5.185) 



where we've defined 



Note that t/ e [0,1]. 



1 ~—khui 1 /k B T I I 1 g—khuj 2 /k- B T I I 1 ^ — khuj 3 /k B T 

y = eV-l k v T e - hul i/' 2k B T e - h ^2/ 2k B T e -h>*>3/ 2k B T (5 



Let's assume that the trap is approximately anisotropic, which entails that the frequency ratios uj 1 /oj 2 etc. are all 
numbers on the order of one. Let us further assume that k B T hui 1 2 3 . Then 



1 



1 _ e ~khu 3 /k B T 

where k*{T) = k B T/hCu > 1, with 



'M- k<k*(T) 

(5.187) 

.1 k>k*{T) 



1 /3 

Hi = (uj 1 lu 2 W 3 ) . (5.188) 



10 Many reliable descriptions may be found on the web. Check Wikipedia, for example. 
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We then have 

where the first term on the RHS is due to k > k* and the second term from k < k* in the previous sum. Since 
k* 1 and since the sum of inverse cubes is convergent, we may safely extend the limit on the above sum to 
infinity. To help make more sense of the first term, write A^ = (y -1 — l) 1 for the number of particles in the 

(%, n 2! n 3 ) = (0,0,0) state. Then 

V = , N ° ■ (5-190) 

J N + l v ' 

This is true always. The issue vis-a-vis Bose-Einstein condensation is whether N Q ^> 1. At any rate, we now see 
that we can write 

N « N (1 + AT- 1 )-'* + Li 3 (y) . (5.191) 

As for the first term, we have 

f A^ < k* 

N (l + No l )~ k ' = I (5.192) 
[N AT »fc* 

Thus, as in the case of IBG condensation of ballistic particles, we identify the critical temperature by the condition 

y = N /(N +l)wl, and we have 

where v = uj/2tt. We see that k B T c ^ to if the number of particles in the trap is large: AT 3> 1. In this regime, we 
have 

T<T C : N = N + C(3) f^j?) (5.194) 

fc = T x3 



T>T C : N =[^-)U 3 (y). (5.195) 

It is interesting to note that BEC can also occur in two-dimensional traps, which is to say traps which are very 
anisotropic, with oblate equipotential surfaces V(r) = V . This happens when Hlu 3 ^> k B T 3> u> 1 2 . We then have 

1 /2 

with H) = (uj 1 uj 2 ) ■ The particle number then obeys a set of equations like those in eqns. 5.194 and 5.195, mutatis 
mutandis 11 . 

For extremely prolate traps, with uj 3 <C w 1 2 , the situation is different because Li 1 (y) diverges for y = 1. We then 
have 

t rri 

N = N Q + —2 — In (l + N Q ) . (5.197) 

to 3 

Here we have simply replaced y by the equivalent expression N / (N + 1). If our criterion for condensation is that 
A^ = aN, where a is some fractional value, then we have 



"Explicitly, one replaces C(3) with C(2) = Li 3 (y) with Li 2 (y), and (k B T/hw) z with (fc B T/to) 2 
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5.7.7 Example problem from Fall 2004 UCSD graduate written exam 

PROBLEM : A three-dimensional gas of noninteracting bosonic particles obeys the dispersion relation e(k) = A \k 



1/2 



(a) Obtain an expression for the density n(T, z) where z = exp(/j, / k B T) is the fugacity. Simplify your expres- 
sion as best you can, adimensionalizing any integral or infinite sum which may appear. You may find it 
convenient to define 



1 f y- 1 ^2 fe 



Note Li„(l) = C,(v), the Riemann zeta function. 



o 



fc=i 



(b) Find the critical temperature for Bose condensation, T c (n). Your expression should only include the density 
n, the constant A, physical constants, and numerical factors (which may be expressed in terms of integrals 
or infinite sums). 

(c) What is the condensate density n when T = \T C 1 

(d) Do you expect the second virial coefficient to be positive or negative? Explain your reasoning. (You don't 
have to do any calculation.) 

SOLUTION : We work in the grand canonical ensemble, using Bose-Einstein statistics. 

(a) The density for Bose-Einstein particles are given by 

, d 3 k 1 

n(T,z) 



(2tt) 3 z- 1 exp(Afc 1 /2/fc B T) - 1 

□ 

* 5 

dS z^—l ( 5 - 200 > 



1 fKT^ 



o 

where we have changed integration variables from k to s = Ak 1 / 2 /k B T, and we have defined the functions 
Li„(z) as above, in eqn. 5.199. Note Li„(l) = (,{v), the Riemann zeta function. 

(b) Bose condensation sets in for z = 1, i.e. fi = 0. Thus, the critical temperature T c and the density n are related 
by 

_ 120 C(6) ( k B T c x6 



or 



.4 



.4 



2^ \!/6 



(5.201) 



7r n 



TJn) = — . (5.202) 



(c) For T < T c , we have 



n = nr,+ 



120 C(6) A B T x6 



rji \6 



.4 



(5.203) 



n 
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where n is the condensate density. Thus, at T = i T c , 

n (T=|T c ) = in. (5.204) 
(d) The virial expansion of the equation of state is 

p = nk B T(l + B 2 (T) n + B 3 (T) n 2 + . . . ) . (5.205) 



We expect B2(T) < for noninteracting bosons, reflecting the tendency of the bosons to condense. (Corre- 
spondingly for noninteracting fermions we expect .82 (T) > 0.) 

For the curious, we compute B2{T) by eliminating the fugacity z from the equations for n(T, z) and p(T, z). 
First, we find p(T, z): 

d 3 k 



p(T, z) = -k B T J**hi(l- Z cM-Ak^/k u T)) 



k B Tfk B T\ e /rfss 5 ln(l _ ze - s) (52Q6) 



.4 



120fc B T k B T 



7T 2 V A 

Expanding in powers of the fugacity, we have 





6 

Li 7 (z). 



k B T n 2 V A 
Solving for z{n) using the first equation, we obtain, to order n 2 , 



120 fk B T\ 6 r z 2 z 3 i , c „„ m 

^{- J A -){ z+ v + ¥ + ---\ (5 ' 207) 
{ z + ¥ + ¥ + ---}- (5 ' 208) 



120 /LT 



/ -K 2 A 6 n \ If tt 2 A 6 



n 



2 



z = {wy&F ) - ¥ [noo^f ) + ° {n ) ' (5 - 209) 

Plugging this into the equation for p(T, z), we obtain the first nontrivial term in the virial expansion, with 

.2 / A X 6 



W) = ~ 15360 UtJ' (5 - 210) 
which is negative, as expected. Note also that the ideal gas law is recovered for T — > 00, for fixed n. 

5.8 The Ideal Fermi Gas 

5.8.1 Grand potential and particle number 

The grand potential of the ideal Fermi gas is, per eqn. 5.14, 

fl(T,V,n) = -Vk B T J^ln (l + e* lk * T e - £ =/ fc B T ) 

(5.211) 

(^-s)/k B T\ 



-Vk B T J deg(e) In (l + e 
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The average number of particles in a state with energy e is 



hence the total number of particles is 



N = Vjde 9 (e) e(e _ ii)/kBT + i . (5.213) 

— oo 



5.8.2 The Fermi distribution 

We define the function 

= m - ( 5 - 214 ) 

known as the Fenwz distribution. In the T — > oo limit, /(e) — > | for all finite values of e. As T — > 0, /(e) approaches 
a step function 0(— e). The average number of particles in a state of energy e in a system at temperature T and 
chemical potential fi is n(e) = f(e — fi). In fig. 5.12 we plot f(e — fi) versus e for three representative temperatures. 



5.8.3 T = and the Fermi surface 

At T = 0, we therefore have n(e) = 9(/i — e), which says that all single particle energy states up to e = \x are 
filled, and all energy states above e = [i are empty. We call /i(T = 0) the Fermi energy: e F = /i(T = 0). If the 
single particle dispersion e(k) depends only on the wavevector k, then the locus of points in fc-space for which 
e(k) = e F is called the Fermi surface. For isotropic systems, e{k) = e(k) is a function only of the magnitude k = |fc|, 
and the Fermi surface is a sphere in d — 3 or a circle in d = 2. The radius of this circle is the Fermi wavevector, k F . 
When there is internal (e.g. spin) degree of freedom, there is a Fermi surface and Fermi wavevector (for isotropic 
systems) for each polarization state of the internal degree of freedom. 
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Let's compute the Fermi wavevector k F and Fermi energy e F for the IFG with a ballistic dispersion e(fc) 
h 2 k 2 /2m. The number density is 



n = g d d kO(k F -k) 



;0 d k d F 



(27r) d d 



gk F /n (d=l) 
gk F /4ir (d = 2) 
gfc F /6^ 2 (d = 3) 



(5.215) 



Note that the form of n(k F ) is independent of the dispersion relation, so long as it remains isotropic. Inverting the 
above expressions, we obtain k F (n): 



k F = 2tt 



l/d 



im/g (d=l) 
(4 7 rn/g) 1 /2 (d = 2) 



(5.216) 



(67r 2 n/g) 1/3 (d = 3) . 
The Fermi energy in each case, for ballistic dispersion, is therefore 



h 2 k F _ 2n 2 h 2 

2m m 



dn 
g^ 



2/d 



s 2 fc 2 2 

tt n n 



2-Kh 2 



(d=l) 

(d = 2) 



(5.217) 



l^r(^) 2/3 (d = 3). 



Another useful result for the ballistic dispersion, which follows from the above, is that the density of states at the 
Fermi level is given by 

ffM = " ^ = I' - ■ (5-218) 



(2tt) c 



fi 2 



2 e c 



For the electron gas, we have g = 2. In a metal, one typically has fc F ~0.5A 1 to2A , and e F ~ 1 eV — 10 eV. 
Due to the effects of the crystalline lattice, electrons in a solid behave as if they had an effective mass m* which 
is typically on the order of the electron mass but very often about an order of magnitude smaller, particularly in 
semiconductors . 

Nonisotropic dispersions e(k) are more interesting in that they give rise to non-spherical Fermi surfaces. The 
simplest example is that of a two-dimensional 'tight-binding' model of electrons hopping on a square lattice, as 
may be appropriate in certain layered materials. The dispersion relation is then 



e(k x ,k y ) = —2t cos(k x a) — 2t cos(k y a) 



(5.219) 



where k x and k y are confined to the interval [ — ^, ^] ■ The quantity t has dimensions of energy and is known as 
the hopping integral. The Fermi surface is the set of points (k x , k y ) which satisfies e{k x ,k y ) = e F . When e F achieves 
its minimum value of e F un = —At, the Fermi surface collapses to a point at (k x , k y ) = (0, 0). For energies just above 
this minimum value, we can expand the dispersion in a power series, writing 



e(k x , k y ) 



-4±- 



ta 2 (kl + kD-rLta* {k x + k 4 y ) + 



(5.220) 



If we only work to quadratic order in k x and k y , the dispersion is isotropic, and the Fermi surface is a circle, with 
(e F + At) i 'ta 2 . As the energy increases further, the continuous 0(2) rotational invariance is broken down to 



k F 
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Figure 5.13: Fermi surfaces for two and three-dimensional structures. Upper left: free particles in two dimensions. 
Upper right: 'tight binding' electrons on a square lattice. Lower left: Fermi surface for cesium, which is predom- 
inantly composed of electrons in the 6s orbital shell. Lower right: the Fermi surface of yttrium has two parts. 
One part (yellow) is predominantly due to 5s electrons, while the other (pink) is due to Ad electrons. (Source: 
www.phys. ufl.edu/ fermisurface/) 

the discrete group of rotations of the square, C 4v . The Fermi surfaces distort and eventually, at e F = 0, the Fermi 
surface is itself a square. As e F increases further, the square turns back into a circle, but centered about the point 
(o' o)" Note that everything is periodic in k x and k y modulo ^L. The Fermi surfaces for this model are depicted 
in the upper right panel of fig. 5.13. 

Fermi surfaces in three dimensions can be very interesting indeed, and of great importance in understanding the 
electronic properties of solids. Two examples are shown in the bottom panels of fig. 5.13. The electronic configu- 
ration of cesium (Cs) is [Xc] 6s 1 . The 6s electrons 'hop' from site to site on a body centered cubic (BCC) lattice, a 
generalization of the simple two-dimensional square lattice hopping model discussed above. The elementary unit 
cell in k space, known as the first Brillouin zone, turns out to be a dodecahedron. In yttrium, the electronic structure 
is [Kr] 5s 2 Ad 1 , and there are two electronic energy bands at the Fermi level, meaning two Fermi surfaces. Yttrium 
forms a hexagonal close packed (HCP) crystal structure, and its first Brillouin zone is shaped like a hexagonal 
pillbox. 



5.8.4 Spin-split Fermi surfaces 



Consider an electron gas in an external magnetic field H. The single particle Hamiltonian is then 

- P 2 

H = \-/j, B Ha , 

2m 



(5.221) 
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where ^ B is the Bohr magneton, 

A« B = tt^- = 5-788 x 1(T 9 cV/G 

2mc ' (5.222) 

Ai B /fc B = 6.717 x 1CT 5 K/G , 

where m is the electron mass. What happens at T = to a noninteracting electron gas in a magnetic field? 

Electrons of each spin polarization form their own Fermi surfaces. That is, there is an up spin Fermi surface, 
with Fermi wavevector fc F j, and a down spin Fermi surface, with Fermi wavevector fc F , . The individual Fermi 
energies, on the other hand, must be equal, hence 

which says 



_ 2eH 
kp l - /c FT — 

The total density is 



fe F | /cp^ — ~fi ' (5.224) 



"=7# + ^ => k^ + k^ = 6^n. (5.225) 

Clearly the down spin Fermi surface grows and the up spin Fermi surface shrinks with increasing H . Eventually, 
the minority spin Fermi surface vanishes altogether. This happens for the up spins when fe F | = 0. Solving for the 
critical field, we obtain 

ff c = g.( 67 r 2 n) 1/3 . (5.226) 

In real magnetic solids, like cobalt and nickel, the spin-split Fermi surfaces are not spheres, just like the case of the 
(spin degenerate) Fermi surfaces for Cs and Y shown in fig. 5.13. 

5.8.5 The Sommerfeld expansion 

In dealing with the ideal Fermi gas, we will repeatedly encounter integrals of the form 

oo 

l(T,f,) = [def{e-fx)cj>(e). (5.227) 



The Sommerfeld expansion provides a systematic way of expanding these expressions in powers of T and is an 
important analytical tool in analyzing the low temperature properties of the ideal Fermi gas (IFG). 

We start by defining 

e 

<P(e) = {de'4>{e') (5.228) 



so that cf)(e) = <P'(e). We then have 

oo 

I =jde f(e - fi) — 



(5.229) 

dsf{e)${ii + e), 
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where we assume <P(— oo) = 0. Next, we invoke Taylor's theorem, to write 



$(a + e)=y — - — 



exp ( e - J *M 



(5.230) 



This last expression involving the exponential of a differential operator may appear overly formal but it proves 
extremely useful. Since 

1 e t/k B T 



we can write 



I = dv 



k B T ( e e/k B T + j\ 



(5.231) 



(5.232) 



(e v + l)(e-' u + 1) 

— oo 

with v — e/k B T, where 

D = kr4- (5.233) 
is a dimensionless differential operator. The integral can now be done using the methods of complex integration: 12 



oo 

f e vD ^ 

/ dv 7 7T7 7T = 27TI > ReS 

J (e« + l)(e-» + 1) ^ 



(e u + l)(e-« + 1) 



i?— (2n+l)i7i 



-27rz^^ e ( 2 " +1 ) 

n=0 

2?riD e"- 



1-e 



2niD 



ttD cscnD 



(5.234) 



Thus, 

T(T, /i) = ttD csc(7rL>) #(//) , (5.235) 

which is to be understood as the differential operator ttD(cscttD) = irD/ sin(7rZ?) acting on the function <P(fi). 
Appealing once more to Taylor's theorem, we have 



Thus, 



I{T,n)= def( S -[i)<f>(e) 



(5.236) 



iecf>(e) + y (fc B T) 2 0'( M ) + ^ (fc B T)V» + 



(5.237) 



If </>(e) is a polynomial function of its argument, then each derivative effectively reduces the order of the polyno- 
mial by one degree, and the dimensionless parameter of the expansion is (T/ /z) 2 . This procedure is known as the 

Sommerfeld expansion. 



2 Note that writing v = (2n + 1) in + e we have 



-l=Fe 



+ ... , so (e* + l)(e-« + 1) 



We then expand 



e (2n+i)iTrD (i + e D + . . .) to find the residue: Res = -De 



(2n + l)iirD 
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Figure 5.14: Deformation of the complex integration contour in eqn. 5.234. 
5.8.6 Chemical potential shift 

As our first application of the Sommerfeld expansion formalism, let us compute fi(n, T) for the ideal Fermi gas. 
The number density n(T, ^) is 



n = deg(e) f(e - (j,) 



Jjdeg{e) + ^{k li T) 2 g'^) + 



(5.238) 



Let us write /i = e F + <5/i, where e F = ji(T = 0, n) is the Fermi energy, which is the chemical potential at T = 0. 
We then have 



n= j de g(e) + — (fc B T) 2 g'{e F + 6/i) + 



=jd£g{e)+g(e F )5n+ — (k B T) 2 g' (e F ) + 



(5.239) 



from which we derive 



Sfi=-^-(k B T) 2 ^4+0(T 4 ) 



6 V " B_/ g{e F ) 

Note that g' / g = (In g)' . For a ballistic dispersion, assuming g = 2, 



9(e) =2 J- 



d 3 k J h 2 k 2 \ mkie) 
o \ e — 



(27T) 



2?7l 7 TT 2 H 2 



Thus, 17(e) oc e 1 / 2 and (In g)' = |e 1 , so 



M(n,T) = e F --^— + 



(5.240) 



(5.241) 



(5.242) 
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where e F (n) = ^(37r 2 n) 2 / 3 . 

5.8.7 Specific heat 

The energy of the electron gas is 

oo 

— = ldeg{e)sf{e- n) 



-oo 



de g(e) e + y (/c B T) 2 — gfaf) + ... 

-oo 

/ 7T 2 7T 2 

/ de g(e) e + g(e F ) e F S^+ — (k B T) 2 e F g\e F ) + — (fc B T) 2 g(e F ) 



where 



where 



Note that the molar heat capacity is 



g(e F ) gmfc F 6n 3 



F 



(5.243) 



e = Jde g(s) e (5.244) 

— oo 

is the ground state energy density (i.e. ground state energy per unit volume). Thus, 

'dE\ 



C v,n=[^) = ^ VklTg{e F ) = VjT , (5.245) 



'V,N ° 

1 = (5^246) 



N 3 n 2 \ e F J 

where in the last expression on the RHS we have assumed a ballistic dispersion, for which 



(5.248) 



The molar heat capacity in eqn. 5.247 is to be compared with the classical ideal gas value of Relative to 
the classical ideal gas, the IFG value is reduced by a fraction of (7r 2 /3) x (k B T/e F ), which in most metals is very 
small and even at room temperature is only on the order of 10~ 2 . Most of the heat capacity of metals at room 
temperature is due to the energy stored in lattice vibrations. 

5.8.8 Magnetic susceptibility and Pauli paramagnetism 

Magnetism has two origins: (i) orbital currents of charged particles, and (ii) intrinsic magnetic moment. The 
intrinsic magnetic moment m of a particle is related to its quantum mechanical spin via 

m = gu^S/h , u n = — = magneton , (5.249) 

2mc 



5.8. THE IDEAL FERMI GAS 



227 



where g is the particle's g-factor, fi its magnetic moment, and S is the vector of quantum mechanical spin oper- 
ators satisfying [S a , S 13 ] = ihe at3j S 1 , i.e. SU(2) commutation relations. The Hamiltonian for a single particle is 
then 



H=J-( p -1a) 2 Hm 



2m* V c 



1 / e \ 2 o 

\p+-A) +j-[i B Ha, 



(5.250) 



2m* V c / 2 



where in the last line we've restricted our attention to the electron, for which q = — e. The g-factor for an electron 
is g = 2 at tree level, and when radiative corrections are accounted for using quantum electrodynamics (QED) one 
finds g = 2.0023193043617(15). For our purposes we can take g = 2, although we can always absorb the small 
difference into the definition of /Lt B , writing /x B — > p, B = geh/Amc. We've chosen the £-axis in spin space to point 
in the direction of the magnetic field, and we wrote the eigenvalues of S z as \ fia, where a = ±1. The quantity 
m* is the effective mass of the electron, which we mentioned earlier. An important distinction is that it is m* which 
enters into the kinetic energy term p 2 /2m*, but it is the electron mass m itself (m = 511 keV) which enters into the 
definition of the Bohr magneton. We shall discuss the consequences of this further below. 

In the absence of orbital magnetic coupling, the single particle dispersion is 

e a (k) = ^ + ^Ha. (5.251) 
At T = 0, we have the results of §5.8.4. At finite T , we once again use the Sommerfeld expansion. We then have 

oo oo 

n= Jdeg r (e)f(e-fi) + I ds g^e) f(e - fi) 

— oo — OO 

oo 

= \ Jde{g(e-v B H)+g{e + fi B H)}f{e-fi) (5.252) 

le{g(e) + {fL B HYg"{e) + ...)f{e-LL). 
We now invoke the Sommerfeld expension to find the temperature dependence: 



n = jde g(e) + — (k B T) 2 g'^) + ( f \H) 2 </( M ) + 



7T 2 

de g(e) + g(s F ) 8f*+ — (k B T) 2 g'(e F ) + {fi B H) 2 g'(e F ) + 



(5.253) 



Note that the density of states for spin species a is 

9(T (e) = lg(e-il B Ha), (5.254) 

where g(e) is the total density of states per unit volume, for both spin species, in the absence of a magnetic field. 
We conclude that the chemical potential shift in an external field is 

^2 , it. v\A -9'( £ f) 



6»(T, n,H)=-{'- {k B T) + {^Hf \ + . . . . (5.255) 

9\ £ f) 
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Figure 5.15: Fermi distributions in the presence of an external Zeeman-coupled magnetic field. 
We next compute the difference — in the densities of up and down spin electrons: 

lds{g ] (e)-g l (e)}f(s-f,) 

k {.9(e - K H ) - .9(e + K H )} f( £ ~ M) 

= -pL s H ■ ttD csc(ttL>) g(ji) + 0(H 3 ) . 

We needn't go beyond the trivial lowest order term in the Sommerfeld expansion, because H is already assumed 
to be small. Thus, the magnetization density is 



(5.256) 



in which the magnetic susceptibility is 



M = -£ B (n T - n \ ) = Mb 9{ £ f) h 



(dM\ _ 2 , . 



(5.257) 



(5.258) 



This is called the Pauli paramagnetic susceptibility. 



5.8.9 Landau diamagnetism 

When orbital effects are included, the single particle energy levels are given by 

h 2 k 2 

e(n, k z ,a) = (n+ \)frw c + + fi B H a . 

Here n is a Landau level index, and ui c = eH/m*c is the cyclotron frequency. Note that 

p, B H gehH m*c g m* 



hu>„ 



Amc HeH 4 m 



(5.259) 



(5.260) 
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Accordingly, we define the ratio r = (g/2) x (to* /to). We can then write 

h 2 kl 



e(n,k z ,a) = (n + ^ + ^ r<T ) ^ c 



2m* 



The grand potential is then given by 

oo 

Q = . Lz . f^L V lTl[l + e ^/ k B T ^(n+i + ^ra^J^T e -h 2 k 2 z /2mk B T 



(5.261) 



(5.262) 



A few words are in order here regarding the prefactor. In the presence of a uniform magnetic field, the energy 
levels of a two-dimensional ballistic charged particle collapse into Landau levels. The number of states per Landau 
level scales with the area of the system, and is equal to the number of flux quanta through the system: = 
HA/(f> , where <fi = he /e is the Dirac flux quantum. Note that 



— — • L 7 ■ k„T = hiu r ■ 
00 A| 



hence we can write 



where 



n=0 a=±l 



V fdk 



In 



1 + e -^/k B T e -h 2 kl/2m*k B T 



We now invoke the Euler-MacLaurin formula, 

oo 

oo 

E f W= dxF(x) + ±F(0)-±F'(0) + ... , 

n=0 { 

resulting in 

{oo 
Jde Q(e - /a) + \ hu c Q(\(l + ra)frw c - fi) 

i(l+r CT )fiw c 

- ^ (^ c ) 2 Q'(|(l + ro)hu c -/*)+•• 
We next expand in powers of the magnetic field H to obtain 

oo 

Q{T, V,n,H) = 2f> de Q(e - n) + (\r 2 - ±) (hu c ) 2 Q'(-/x) + 



Thus, the magnetic susceptibility is 



1 d 2 Q 
VdH 2 



(r 2 -I).^-K-*) 2 -(-|0'(-M)) 



~1 2 



(5.263) 



(5.264) 



(5.265) 



(5.266) 



(5.267) 



(5.268) 



(5.269) 
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where k t is the isothermal compressibility 13 . In most metals we have m* sw m and the term in brackets is positive 
(recall g w 2). In semiconductors, however, we can have m* -c m; for example in GaAs we have m* = 0.067. 
Thus, semiconductors can have a diamagnetic response. If we take g = 2 and m* = m, we see that the orbital 
currents give rise to a diamagnetic contribution to the magnetic susceptibility which is exactly — | times as large 
as the contribution arising from Zeeman coupling. The net result is then paramagnetic (X > 0) and | as large as 
the Pauli susceptibility. The orbital currents can be understood within the context of Lenz's law. 

Exercise : Show that —y Q'(—fj,) = n 2 n T . 



5.8.10 White dwarf stars 

There is a nice discussion of this material in R. K. Pathria, Statistical Mechanics. As a model, consider a mass 
M ~ 10 33 g of helium at nuclear densities of p ~ 10 7 g/cm 3 and temperature T ~ 10 7 K. This temperature is 
much larger than the ionization energy of 4 He, hence we may safely assume that all helium atoms are ionized. If 
there are N electrons, then the number of a particles (i.e. 4 He nuclei) must be \N. The mass of the a particle is 
m a w 4m p . The total stellar mass M is almost completely due to a particle cores. 



The electron density is then 

N 2 ■ M/im u p 

' p - 1 ~ 10 30 cm" 3 , (5.270) 



V V 2m p 

since M = N ■ m e + ■ 4m p . From the number density n we find for the electrons 

k F = (3tt 2 77.) 1/3 = 2.14 x 10 10 cm" 1 (5.271) 

p F = hk F = 2.26 x 10" 17 gcm/s (5.272) 

mc= (9.1 x 10~ 28 g)(3 x 10 10 m/s) = 2.7 x 10" 17 gcm/s . (5.273) 

Since p F ~ mc, we conclude that the electrons are relativistic. The Fermi temperature will then be T F ~ mc 2 ~ 
10 6 eV ~ 10 12 K. Thus, T <C T { which says that the electron gas is degenerate and may be considered to be at 
T ~ 0. So we need to understand the ground state properties of the relativistic electron gas. 



The kinetic energy is given by 
The velocity is 



e(p) = vVc 2 + m 2 c 4 - mc 2 . (5.274) 
de pc 2 



dp ^p 2 c 2 + m 2 c 4 



(5.275) 



The pressure in the ground state is 

Po = \n(p ■ v) 



Pf 

1 / 2 p 2 C 2 



37T 2 /! 3 J dPP x / p 2 c 2 + m 2 c i 







(5.276) 



3tt 2 /i 3 

o 



96tt 2 /i 3 



d9 smh A 9 



(sinh(46» F ) - 8sinh(26> F ) + 12 6* 



i 3 We've U sed-AQ' (At ) = _^!_fi 
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Figure 5.16: Mass-radius relationship for white dwarf stars. (Source: Wikipedia). 



where we use the substitution 

p = mc sinh 9 , v = c tanh ( 

Note thatp F = hk F = /i^tt 2 ?!) 1 / 3 , and that 

M 



2m p V 



3tt n = 



= ±ln 



9?r M 



c + v 



c — v 



8 R 3 m r 



Now in equilibrium the pressure p is balanced by gravitational pressure. We have 

dE = -p dV = -p (R) ■ 4nR 2 dR . 

This must be balanced by gravity: 

GM 2 

dE s =i- —gr dR > 

where 7 depends on the radial mass distribution. Equilibrium then implies 

7 GM 2 



Po(R) 



4tt R 4 



To find the relation R = R(M), we must solve 

7 gM 2 m 4 c 5 



4tt R 4 96n 2 h 3 



(sinh(46» F ) - 8sinh(26» F ) + 12 F ) 



Note that 



I<4 *f-0 



sinh(46» F ) - 8sinh(26» F ) + 126» F = 



(5.277) 
(5.278) 

(5.279) 
(5.280) 

(5.281) 

(5.282) 
(5.283) 



\e 4S * e F 



00 
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Thus, we may write 



h 2 ( 9tt M \ 
15tr' 2 m \ 8 m p J 

he ( 97T 

127^ \_ 8 W 



5/3 



(5.284) 



4/3 



#p — > 00 



In the limit 9 F — > 0, we solve for R(M) and find 

« = 3^ 0-) 2/3 



h 2 



Gm^mM 1 / 3 



(5.285) 



In the opposite limit 9 F — > oo, the i? factors divide out and we obtain 



M = Mr, 



9 /3tt 



64 V7 



G 



(5.286) 



To find the R dependence, we must go beyond the lowest order expansion of eqn. 5.283, in which case we find 



9tt 



— 



1/3 



mc / \ m 



M 



1/3 



Mn 



1/2 



(5.287) 



The value M is the limiting size for a white dwarf. It is called the Chandrasekhar limit. 



Chapter 6 

Classical Interacting Systems 
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6.2 Ising Model 
6.2.1 Definition 

The simplest model of an interacting system consists of a lattice C of sites, each of which contains a spin a i which 
may be either up (a i = +1) or down (er- = —1). The Hamiltonian is 

<«) « 

When J > 0, the preferred (i.e. lowest energy) configuration of neighboring spins is that they are aligned, i.e. 
Oi<Jj = +1. The interaction is then called ferromagnetic. When J < the preference is for anti-alignment, i.e. 
a i Gj = — 1, which is antiferromagnetic. 

This model is not exactly solvable in general. In one dimension, the solution is quite straightforward. In two 
dimensions, Onsager's solution of the model (with H = 0) is among the most celebrated results in statistical 
physics. In higher dimensions the system has been studied by numerical simulations (the Monte Carlo method) 
and by field theoretic calculations (renormalization group), but no exact solutions exist. 



6.2.2 Ising model in one dimension 

Consider a one-dimensional ring of N sites. The ordinary canonical partition function is then 

Z . = Tr e~ pA 

^ring 1 1 c 

JV 

= II e /3Jff " CT "+ 1 e^° Ha ™ (6.2) 
= Tr(R N ), 

where c N+1 = o x owing to periodic (ring) boundary conditions, and where R is a 2 x 2 transfer matrix, 

R / _ e tUaa' e /3 Mo ff(<7+<7')/2 



e /3J e Pn H e -0J 



(6.3) 



where r™ are the Pauli matrices. Since the trace of a matrix is invariant under a similarity transformation, we have 

Z(T, H, N) = X^ + X N , (6.4) 

where 



\±(T,H) = e 13 ' 1 cosh(f3^ H) ± ^ e 2 ? J sinh 2 ((3 fx H) + e- 2 /" (6.5) 

are the eigenvalues of R. In the thermodynamic limit, N — > oo, and the term dominates exponentially We 
therefore have 

F(T, H, N) = -Nk B T In A + (T, H) . (6.6) 
From the free energy, we can compute the magnetization, 

M = - (—) = Nf *° sinh (/^pg) (6 7) 



dH 



T,N 
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and the zero field isothermal susceptibility, 

Note that in the noninteracting limit J — ► we recover the familiar result for a free spin. The effect of the interac- 
tions at low temperature is to vastly increase the susceptibility. Rather than a set of independent single spins, the 
system effectively behaves as if it were composed of large blocks of spins, where the block size £ is the correlation 
length, to be derived below. 

The physical properties of the system are often elucidated by evaluation of various correlation functions. In this 
case, we define 



Tr ( (ji R„ „ • ■ • R„ „ o"~> _i_ i R~ ~ ■ ■ ■ R~ ~ 
C(n) = (a, a n+l ) = -fr (R N ) 

Tr (S R n £ R N ~ n ) 



Tr (R N ) 

where < n < N, and where 



1 

-1 



(6.9) 



(6.10) 



To compute this ratio, we decompose R in terms of its eigenvectors, writing 

i2 = A + |+X+| + A_ |-X-| . (6.11) 

Then 



A? S\ + + A? El_ + (A?-* XI + XI A?-") E + _E_ + 
C ( n > = A jv + A iv > ( 6 - 12 ) 

S^, =( f i\S\ f i') . (6.13) 



where 



6.2.3 H = 

Consider the case H = 0, where i? = e^ J + e~^ J t x , where t x is the Pauli matrix. Then 

|±) = -1=(|T>±|4>), (6.14) 

i.e. the eigenvectors of R are 

fa - 75 (±,) ■ «>- i5 > 

and ^ ++ = = 0, while U± = £ i = 1. The corresponding eigenvalues are 

A + = 2cosh(/3J) , A_ = 2sinh(/3J) . (6.16) 
The correlation function is then found to be 

A JV-|n| A |n| + x \n\ ^V-|n| 

C{n) = (a 1 a n+1 ) = -± ~ - A + 



tanh^'^+tanh^- 1 " 1 ^^ (6.17) 
1 +tanh A '(/3J) 

tanh H (/3J) (A^^oo). 
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This result is also valid for n < 0, provided |n| < N. We see that we may write 

C(n) = e"l"l/«( T ) , (6.18) 

where the correlation length is 

^ = lnctnh(W) • (6 - 19) 
Note that £(T) grows as T -> as £ w § e 2J / fc B T . 

6.2.4 Chain with free ends 

When the chain has free ends, there are (iV— 1) links, and the partition function is 

^chain = E ("^ jcrcr' 
cr,a' 

= E { A r : MS) + ^ i>- W iM*0} - 



(6.20) 



where V ; ±(°") = ( cr | ± ). When H = 0, we make use of eqn. 6.15 to obtain 

fliV " 1 = 5(i l)(2cosh/3j) JV - 1 + i(_ 1 1 - I 1 )(2sinh/3J) JV - 1 ) (6.21) 



and therefore 

" J chain 



^chain = 2 A 'cosh JV - 1 (/3J). (6.22) 



There's a nifty trick to obtaining the partition function for the Ising chain which amounts to a change of variables. 
We define 

v n = G n <Vfi (n = l, ... , N -1) . (6.23) 
Thus, v l = cr 1 (7 2 , ^2 = cr 2 cr 3' e ^ c - Note that each takes the values ±1. The Hamiltonian for the chain is 

N-l N-l 

Achate = -J £ CT » = £ "» ' ( 6 - 24 ) 

n— 1 n— 1 

The state of the system is defined by the N Ising variables {o^ , v x , . . . , ^_x}- Note that doesn't appear in the 
Hamiltonian. Thus, the interacting model is recast as N — 1 noninteracting Ising spins, and the partition function 
is 



Z , . = Tr e-^cham 

chain 



= ££•••£ J Jv ^ Jv * 



. . . e P Jv N-l 



°~1 v l "N-l 



(6.25) 



. N-l 



= E E e ^ =2 Ar cosh w - 1 (/3J) 
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6.2.5 Ising model in two dimensions : Peierls' argument 



We have just seen how in one dimension, the Ising model never achieves long-ranged spin order. That is, the 
spin-spin correlation function decays asymptotically as an exponential function of the distance with a correlation 
length £(T) which is finite for all > 0. Only for T = does the correlation length diverge. At T = 0, there are two 
ground states, | TTTT ' ' ' T ) an d ||||| ■ ■ • I )• To choose between these ground states, we can specify a boundary 
condition at the ends of our one-dimensional chain, where we demand that the spins are up. Equivalently, we can 
apply a magnetic field H of order 1/N , which vanishes in the thermodynamic limit, but which at zero temperature 
will select the 'all up' ground state. At finite temperature, there is always a finite probability for any consecutive 
pair of sites (n, n+1) to be in a high energy state, i.e. either 1 f J. ) or | |T )• Such a configuration is called a domain 
wall, and in one-dimensional systems domain walls live on individual links. Relative to the configurations | |T ) 
and | H ), a domain wall costs energy 2 J. For a system with M = xN domain walls, the free energy is 



F = 2MJ — fc B Tln (j^ 

= N ■ \ 2Jx + k B T\x]nx + (1 - x)ln(l - x) 



(6.26) 



Minimizing the free energy with respect to x, one finds x = 1 / (e 2J / fc B T + l), so the equilibrium concentration of 
domain walls is finite, meaning there can be no long-ranged spin order. In one dimension, entropy wins and there 
is always a thermodynamically large number of domain walls in equilibrium. And since the correlation length for 
T > is finite, any boundary conditions imposed at spatial infinity will have no thermodynamic consequences 
since they will only be 'felt' over a finite range. 

As we shall discuss in the following chapter, this consideration is true for any system with sufficiently short- 
ranged interactions and a discrete global symmetry. Another example is the g-state Potts model, 

ff =-- / EV,- ft Ev- (6 - 27) 

Here, the spin variables a i take values in the set {1, 2, . . . , q} on each site. The equivalent of an external magnetic 
field in the Ising case is a field h which prefers a particular value of a (cr = 1 in the above Hamiltonian). See the 
appendix in §6.8 for a transfer matrix solution of the one-dimensional Potts model. 

What about higher dimensions? A nifty argument due to R. Peierls shows that there will be a finite temperature 
phase transition for the Ising model on the square lattice 1 . Consider the Ising model, in zero magnetic field, on 
a N x x N square lattice, with N x — > oo in the thermodynamic limit. Along the perimeter of the system we 
impose the boundary condition <j i = +1. Any configuration of the spins may then be represented uniquely in 
the following manner. Start with a configuration in which all spins are up. Next, draw a set of closed loops on 
the lattice. By definition, the loops cannot share any links along their boundaries, i.e. each link on the lattice is 
associated with at most one such loop. Now flip all the spins inside each loop from up to down. Identify each 
such loop configuration with a label r. The partition function is 

Z = Tr e"^ = ]T e- 2fiJL r , (6.28) 
r 

where L r is the total perimeter of the loop configuration r. The domain walls are now loops, rather than individ- 
ual links, but as in the one-dimensional case, each link of each domain wall contributes an energy +2 J relative to 
the ground state. 

Now we wish to compute the average magnetization of the central site (assume N x y are both odd, so there is a 
unique central site). This is given by the difference P + (0) — P_(0), where -P M (0) = (8 a p ) is the probability that 



^ere we modify slightly the discussion in chapter 5 of the book by L. Peliti. 
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Figure 6.1: Clusters and boundaries for the square lattice Ising model. Left panel: a configuration P where the 
central spin is up. Right panel: a configuration C 7 o T where the interior spins of a new loop 7 containing the 
central spin have been flipped. 

the central spin has spin polarization \i. If P,(0) > P_(0), then the magnetization per site m = P + (0) — P_(0) is 
finite in the thermodynamic limit, and the system is ordered. Clearly 



-20JL r 



(6.29) 



res. 



where the restriction on the sum indicates that only those configurations where the central spin is up (er = +1) 
are to be included, (see fig. 6.1a). Similarly, 



-2/3 JL, 



(6.30) 



res_ 



where only configurations in which a = — 1 are included in the sum. Here we have defined 



s± = {r I <r = ±} . 



(6.31) 



i.e. £, (Z7_) is the set of configurations P in which the central spin is always up (down). Consider now the 
construction in fig. 6.1b. Any loop configuration P G XL may be associated with a unique loop configuration 
P € S + by reversing all the spins within the loop of P which contains the origin. Note that the map from P to 
P is many-to-one. That is, we can write P = C 7 o P, where C 7 overturns the spins within the loop 7, with the 
conditions that (i) 7 contains the origin, and (ii) none of the links in the perimeter of 7 coincide with any of the 
links from the constituent loops of P. Let us denote this set of loops as T r : 



Then 



T r = {7 : e int(7) and 7 n P = j . 
P + (0)-P_(0) = ^ £ e- 2 ^(l- 53 « 



res. 



-2/3JL. 



(6.32) 



(6.33) 
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If we can prove that J2-yer e 2/3 JL ' r < 1, then we will have established that m > 0. Let us ask: how many loops 
7 are there in Y r with perimeter LI We cannot answer this question exactly, but we can derive a rigorous upper 
bound for this number, which, following Peliti, we call g(L). We claim that 

To establish this bound, consider any site on such a loop 7. Initially we have 4 possible directions to proceed 
to the next site, but thereafter there are only 3 possibilities for each subsequent step, since the loop cannot run 
into itself. This gives 4 ■ 3 i_1 possibilities. But we are clearly overcounting, since any point on the loop could 
have been chosen as the initial point, and moreover we could have started by proceeding either clockwise or 
counterclockwise. So we are justified in dividing this by 2L. We are still overcounting, because we have not 
accounted for the constraint that 7 is a closed loop, nor that 7 n r = 0. We won't bother trying to improve our 
estimate to account for these constraints. However, we are clearly undercounting due to the fact that a given loop 
can be translated in space so long as the origin remains within it. To account for this, we multiply by the area 
of a square of side length L/4, which is the maximum area that can be enclosed by a loop of perimeter L. We 
therefore arrive at eqn. 6.34. Finally, we note that the smallest possible value of L is L = 4, corresponding to a 
square enclosing the central site alone. Therefore 



E 



1 00 4 (O 2\ 

«- wl -<H?><»<- W >*-H^"'- (6 - 35) 



7£T r fe=2 

where x = 3 e~ 2/3J . Note that we have accounted for the fact that the perimeter L of each loop 7 must be an even 
integer. The sum is smaller than unity provided x < x = 0.869756 . . ., hence the system is ordered provided 

< = L61531 • (636) 

J ln(S/x ) 

The exact result is k B T c = 2J/sinh _1 (l) = 2.26918 . . . The Peierls argument has been generalized to higher 
dimensional lattices as well 2 . 

With a little more work we can derive a bound for the magnetization. We have shown that 

p_( ) = - e ~ 2f3JLr E e ~ 2pJLi < r ■ - E e ~ 2PJLr = rP +(°) • ( 6 - 37 ) 

Thus, 

l = j P + (0) + P_(0)<(l + r)P + (0) (6.38) 



and therefore 



where r(T) is given in eqn. 6.35. 



m = P + (0) - P_(0) > (1 - r) P+(0) > i-^ , (6.39) 

1 + r 



6.2.6 Two dimensions or one? 

We showed that the one-dimensional Ising model has no finite temperature phase transition, and is disordered 
at any finite temperature T, but in two dimensions on the square lattice there is a finite critical temperature T c 
below which there is long-ranged order. Consider now the construction depicted in fig. 6.2, where the sites of 



2 See. e.g. J. L. Lebowitz and A. E. Mazel, /. Stat. Phys. 90, 1051 (1998). 
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Figure 6.2: A two-dimensional square lattice mapped onto a one-dimensional chain. 

a two-dimensional square lattice are mapped onto those of a linear chain 3 . Clearly we can elicit a one-to-one 
mapping between the sites of a two-dimensional square lattice and those of a one-dimensional chain. That is, the 
two-dimensional square lattice Ising model may be written as a one-dimensional Ising model, i.e. 

lattice chain 

H = - J a i G i = ~ H J ««' °™ <V ■ ( 6 - 4 °) 

<ii) n,n' 

How can this be consistent with the results we have just proven? 

The fly in the ointment here is that the interaction along the chain J , is long-ranged. This is apparent from 
inspecting the site labels in fig. 6.2. Note that site n = 15 is linked to sites n' = 14 and n' = 16, but also to sites n' = 
—6 and n' = —28. With each turn of the concentric spirals in the figure, the range of the interaction increases. To 
complicate matters further, the interactions are no longer translationally invariant, i.e. J nn , ^ J(n — n'). But it is the 
long-ranged nature of the interactions on our contrived one-dimensional chain which spoils our previous energy- 
entropy argument, because now the domain walls themselves interact via a long-ranged potential. Consider for 
example the linear chain with J nn , = J \n — n'\~ a , where a > 0. Let us compute the energy of a domain wall 
configuration where a n = +1 if n > and er n = — 1 if n < 0. The domain wall energy is then 

A = yy , 2J , . (6.4i) 

m =0n=l 1 1 

Here we have written one of the sums in terms of m = —n'. For asymptotically large m and n, we can write 
R = (m, n) and we obtain an integral over the upper right quadrant of the plane: 



oo «/ 

JdRR Ji 



/2 tt/4 oo 

# t 2J - = 2-/» \J*- . (6.42) 

R a (COS0 + sin0) Q J cos a (j) J R 01 ' 1 

-tt/4 1 



The 4> integral is convergent, but the R integral diverges for a < 2. For a finite system, the upper bound on the 
R integral becomes the system size L. For a > 2 the domain wall energy is finite in the thermodynamic limit 



3 A corresponding mapping can be found between a cubic lattice and the linear chain as well. 



6.2. ISING MODEL 



241 



L — > oo. In this case, entropy again wins. I.e. the entropy associated with a single domain wall is k B InL, and 
therefore F = E — k B T is always lowered by having a finite density of domain walls. For a < 2, the energy of a 
single domain wall scales as L 2 ~ a . It was first proven by F. J. Dyson in 1969 that this model has a finite temperature 
phase transition provided 1 < a < 2. There is no transition for a < 1 or a > 2. The case a = 2 is special, and is 
discussed as a special case in the beautiful renormalization group analysis by J. M. Kosterlitz in Phys. Rev. Lett 
37, 1577 (1976). 



6.2.7 High temperature expansion 

Consider once again the ferromagnetic Ising model in zero field (H = 0), but on an arbitrary lattice. The partition 
function is 

Z = Tr e^ E «) c ' = (cosh^j)^ Tr J JJ (l + astTf tr,) \ , (6.43) 

where x = tanh [3 J and iV L is the number of links. For regular lattices, N L = \zN , where A" is the number of lattice 
sites and z is the lattice coordination number, i.e. the number of nearest neighbors for each site. We have used 



e 0J<T<r' 



I e +l3J if aa' = +1 

cosh (3 J ■ \ l + aa' tanh (3 J } = <_„, , (6.44) 

I J e p if a a = —1 . 



We expand eqn. 6.43 in powers of x, resulting in a sum of 2 N ^ terms, each of which can be represented graphically 
in terms of so-called lattice animals. A lattice animal is a distinct (including reflections and rotations) arrangement 
of adjacent plaquettes on a lattice. In order that the trace not vanish, only such configurations and their compo- 
sitions are permitted. This is because each a i for every given site i must occur an even number of times in order 
for a given term in the sum not to vanish. For all such terms, the trace is 2 N . Let r represent a collection of lattice 
animals, and g r the multiplicity of r. Then 

Z = 2 N (cosh /3 J) Wl 9r ( tanh l 3J ) Lr > ( 6 - 45 ) 
r 

where L r is the total number of sites in the diagram F , and g r is the multiplicity of r. Since x vanishes as T — > oo, 
this procedure is known as the high temperature expansion (HTE). 

For the square lattice, he enumeration of all lattice animals with up to order eight is given in fig. 6.3. For the 
diagram represented as a single elementary plaquette, there are possible locations for the lower left vertex. For 
the 2x1 plaquette animal, one has g = 2N, because there are two inequivalent orientations as well as N trans- 
lations. For two disjoint elementary squares, one has g = ^N(N — 5), which arises from subtracting 5A^ 'illegal' 
configurations involving double lines (remember each link in the partition sum appears only once!), shown in the 
figure, and finally dividing by two because the individual squares are identical. Note that A^A^ — 5) is always 
even for any integer value of N. Thus, to lowest interesting order on the square lattice, 

Z = 2 Ar (cosh/?j) 2Ar {l + Nx A + 2Nx 6 + (7 - |)A^ 8 + ±N 2 x s + C(a; 10 )} . (6.46) 

The free energy is therefore 



F = -fc B Tln2 + ATfc B Tln(l - x 2 ) - Nk B T x 4 + 2 x b + | x s + <D(x 



NKT\n2- Nk 



: B T{x 2 +-lx 4 + lx 6 + fx s + 0(x w )}, 



(6.47) 



again withx = tanh/3J. Note that we've substituted cosh 2 /? J = 1/(1 — x 2 ) to write the final result as a power series 
in x. Notice that the 0(N 2 ) factor in Z has cancelled upon taking the logarithm, so the free energy is properly 
extensive. 
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Figure 6.3: HTE diagrams on the square lattice and their multiplicities. 



Note that the high temperature expansion for the one-dimensional Ising chain yields 

^chainm^) = 2 N cosh Ar - 1 /3J , Z ling (T,N) = 2 N cosh N (3J , (6.48) 

in agreement with the transfer matrix calculations. In higher dimensions, where there is a finite temperature phase 
transition, one typically computes the specific heat c(T) and tries to extract its singular behavior in the vicinity of 
T c , where c(T) ~ A (T - T c )- a . Since x(T) = tanh(J/fc B T) is analytic in T, we have c(x) ~ A' (x - x c )- a , where 
x c = x(T c ). One assumes x c is the singularity closest to the origin and corresponds to the radius of convergence 
of the high temperature expansion. If we write 



oo 



c (*) = 5>„z"~A"(l 

n=0 



(6.49) 



then according to the binomial theorem we should expect 



1 - 



1 — a 



(6.50) 



Thus, by plotting a n /a n _ 1 versus 1/n, one extracts l/x c as the intercept, and (a — l)/x c as the slope. 
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Figure 6.4: HTE diagrams for the numerator Ym of the correlation function Cm ■ The blue path connecting sites k 
and I is the string. The remaining red paths are all closed loops. 

High temperature expansion for correlation functions 

Can we also derive a high temperature expansion for the spin-spin correlation function C kl = (a k cr ; ) ? Yes we 
can. We have 



Tr 






Tr 







(6.51) 



Recall our analysis of the partition function Z. We concluded that in order for the trace not to vanish, the spin 
variable a i on each site i must occur an even number of times in the expansion of the product. Similar considera- 
tions hold for Y kl , except now due to the presence of a k and <r ; , those variables now must occur an odd number of 
times when expanding the product. It is clear that the only nonvanishing diagrams will be those in which there 
is a finite string connecting sites k and I, in addition to the usual closed HTE loops. See fig. 6.4 for an instructive 
sketch. One then expands both Y kl as well as Z in powers of x = tanh (3J , taking the ratio to obtain the correlator 
C kl . At high temperatures (x — > 0), both numerator and denominator are dominated by the configurations r with 
the shortest possible total perimeter. For Z, this means the trivial path r = {0}, while for Y kl this means finding 
the shortest length path from k to I. (If there is no straight line path from k to I, there will in general be several such 
minimizing paths.) Note, however, that the presence of the string between sites k and I complicates the analysis 
of g r for the closed loops, since none of the links of r can intersect the string. It is worth stressing that this does 
not mean that the string and the closed loops cannot intersect at isolated sites, but only that they share no common 
links; see once again fig. 6.4. 



6.3 Nonideal Classical Gases 

Let's switch gears now and return to the study of continuous classical systems described by a Hamiltonian 
H ({Xj}, {p^) ■ In the next chapter, we will see how the critical properties of classical fluids can in fact be modeled 
by an appropriate lattice gas Ising model, and we'll derive methods for describing the liquid-gas phase transition 
in such a model. 
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6.3.1 The configuration integral 

Consider the ordinary canonical partition function for a nonideal system of identical point particles interacting via 
a central two-body potential u(r). We work in the ordinary canonical ensemble. The TV-particle partition function 
is 

N 



Z(T,V,N) = ±[l[^Ae-^ 

4 — 1 



(6.52) 



Here, we have assumed a many body Hamiltonian of the form 

N 2 

# = £f|; + £«(!*< (6-53) 

i=l i<j 



in which massive nonrelativistic particles interact via a two-body central potential. As before, A T = yj2nh 2 /mk B T 
is the thermal wavelength. We can now write 

Z(T,V,N) = X r Nd Q N {T,V), (6.54) 

where the configuration integral Q N (T, V) is given by 

QnF, V) = ± ld d x x ■■■ jd d x N \{ e-^«) . (6.55) 

i<j 

There are no general methods for evaluating the configurational integral exactly. 



6.3.2 One-dimensional Tonks gas 

The Tonks gas is a one-dimensional generalization of the hard sphere gas. Consider a one-dimensional gas of 
indistinguishable particles of mass m interacting via the potential 



u(x — x 1 ) 



oo if \x — x'\ < a 
ti\x-x'\>a 



(6.56) 



Let the gas be placed in a finite volume L. The hard sphere nature of the particles means that no particle can get 
within a distance j a or the ends at x — and x = L. That is, there is a one-body potential v(x) acting as well, 
where 



v(x) 



oo if x < 

if \a < x < L - \a 
oo if x > L — \a . 



(6.57) 



The partition function of the ID Tonks gas is given by 

L 



Z(T,L,N) 



A 



— N 



dx-i 



dxjy \{x-^ , 



(6.58) 
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where X = e~ l 1 l k ^ T is zero if any two 'rods' (of length a) overlap, or if any rod overlaps with either boundary at 
x = and x = L, and X = 1 otherwise. Note that X does not depend on temperature. Without loss of generality 
we can integrate over the subspace where x 1 < x 2 < ■ ■ ■ < x N and then multiply the result by AH . Clearly Xj must 
lie to the right of Xj_ 1 + a and to the left of Y } ■ = L — (N — j)a — \a. Thus, 



1 1 1 2 N 

Z(T, L, N) = N J dx 1 Jdx 2 ■■ J dxj 



a/2 ij+o x N1 +a 

J dx 1 j ' dx 2 ■ ■ ■ J dx N _ x (Yjf_i — x N _ 1 ) 

a/2 ij+n x N _ 2 +a 
Y\ ^2 - 2 



\ T N J dx 1 J dx 2 ■ ■ J dx N _ 2 \ {Y 1 



-2 — x N-i) 



a/2 x ± +a x N _ 3 +a 



N-3~ 

A T (v 1„\ N _ A T IT AT„\N 



{X 1 -^ay y =^-(L-Na) N . (6.59) 



TV! v 1 ' Nl 
The \$ factor comes from integrating over the momenta; recall At = \j2ith 2 /mk B T . 

The free energy is 




(6.60) 



where we have used Stirling's rule to write In TV! ps N In N — N. The pressure is 

dF k B T nk B T 

P = -jrr = ~ = i . 6 - 61 ) 

oL j± - a 1 — na 

where n — N/L is the one-dimensional density. Note that the pressure diverges as n approaches 1/a. The usual 
one-dimensional ideal gas law, pL = Nk B T, is replaced by pL cS = Nk B T, where L cS = L — Na is the 'free' volume 
obtained by subtracting the total 'excluded volume' Na from the original volume L. 



6.3.3 Mayer cluster expansion 

Let us return to the general problem of computing the configuration integral. Consider the function eT^ Ui i , where 
ity = ui\x { — Xj\). We assume that at very short distances there is a strong repulsion between particles, i.e. — ► oo 
as r { j = \xj — Xj\ — > 0, and that — > as — > oo. Thus, e~^"« vanishes as r,^ — > and approaches unity as 
r-y — ► oo. For our purposes, it will prove useful to define the function 

f(r) = e~ pu{r) - 1 , (6.62) 
called the Mayer function after Josef Mayer. We may now write 

Qn(T, V) = ± [d d x 1 ---f d d x N J] (1 + /,.) . (6.63) 

i<j 

A typical potential we might consider is the semi-phenomenological Lennard-J ones potential, 




(6.64) 
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Figure 6.5: Bottom panel: Lennard-Jones potential u(r) = 4e (x 12 — x 6 ), with x = rja and e = 1. Note the weak 
attractive tail and the strong repulsive core. Top panel: Mayer function f(r,T) = e ~ u i r )/ k ^ T — \ f or k B T = 0.8 e 
(blue), fc B T = 1.5 e (green), and fc B T = 5 e (red). 

This accounts for a long-distance attraction due to mutually induced electric dipole fluctuations, and a strong 
short-ranged repulsion, phenomenologically modelled with a r~ 12 potential, which mimics a hard core due to 
overlap of the atomic electron distributions. Setting u'(r) = we obtain r* = 2 1 / 6 a ss 1.12246 a at the minimum, 
where u(r*) = — e. In contrast to the Boltzmann weight e~^ u ^ r \ the Mayer function fir) vanishes as r — > oo, 
behaving as /(r) ~ —j3u{r). The Mayer function also depends on temperature. Sketches of u(r) and f(r) for the 
Lennard-Jones model are shown in fig. 6.5. 

The Lennard-Jones potential 4 is realistic for certain simple fluids, but it leads to a configuration integral which is 
in general impossible to evaluate. Indeed, even a potential as simple as that of the hard sphere gas is intractable in 
more than one space dimension. We can however make progress by deriving a series expansion for the equation of 
state in powers of the particle density. This is known as the virial expansion. As was the case when we investigated 
noninteracting quantum statistics, it is convenient to work in the grand canonical ensemble and to derive series 
expansions for the density n(T, z) and the pressure p(T, z) in terms of the fugacity z, then solve for z(T, n) to 
obtain p(T, n). These expansions in terms of fugacity have a nifty diagrammatic interpretation, due to Mayer. 

We begin by expanding the product in eqn. 6.63 as 

11(1 / y ) =1 + ^4+ £ f iS /« + .... (6.65) 

As there are ^N(N — 1) possible pairings, there are 2 N ( N ~ 1 ^ 2 terms in the expansion of the above product. Each 
such term may be represented by a graph, as shown in fig. 6.7. For each such term, we draw a connection 
between dots representing different particles i and j if the factor / 4 • appears in the term under consideration. The 
contribution for any given graph may be written as a product over contributions from each of its disconnected 

4 Disambiguation footnote: Take care not to confuse Philipp Lenard (Hungarian-German, cathode ray tubes, Nazi), Alfred-Marie Lienard 
(French, Lienard- Wiechert potentials, not a Nazi), John Lennard-Jones (British, molecular structure, definitely not a Nazi), and Lynyrd Skynyrd 
(American, "Free Bird", possibly killed by Nazis in 1977 plane crash). I thank my colleague Oleg Shpyrko for setting me straight on this. 
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Figure 6.6: Left: John Lennard-Jones. Center: Catherine Zeta-Jones. Right: James Earl Jones. 
(/l,4 A, 7 A, 9 (^2,5 ^2,6) (/3,lo) (/8,ll) 



1 4 5 2 6 



2* • • • • 

s „ 
•— — • • • • 



9 7 3 10 12 13 14 

Figure 6.7: Diagrammatic interpretation of a term involving a product of eight Mayer functions. 



component clusters. For example, in the case of the term in fig. 6.7, the contribution to the configurational integral 
would be 



(6.66) 



AQ — — Jd d x 1 d d x 4 d d x 7 d d x 9 / 14 f 4 7 f 4 9 f 7 9 

X 1^ X2 ^ X ^ ^ X ^ ^ 2 ' 5 ^ 2 ' 6 X 1^ Xs ^ Xw ^ 3 ' 10 X J ^ X& ^ Xl1 ^ 8 ' 11 ' 

We will refer to a given product of Mayer functions which arises from this expansion as a term. 

The particular labels we assign to each vertex of a given graph don't affect the overall value of the graph. Now 
a given unlabeled graph consists of a certain number of connected subgraphs. For a system with N particles, we 
may then write 

iV = ^m 7 n 7 , (6.67) 

7 

where 7 ranges over all possible connected subgraphs, and 

m 7 = number of connected subgraphs of type 7 in the unlabeled graph 
n 1 = number of vertices in the connected subgraph 7 . 

Note that the single vertex • counts as a connected subgraph, with n, = 1. We now ask: how many ways are 
there of assigning the N labels to the N vertices of a given unlabeled graph? One might first thing the answer is 
simply N\, however this is too big, because different assignments of the labels to the vertices may not result in a 
distinct graph. To see this, consider the examples in fig. 6.8. In the first example, an unlabeled graph with four 
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vertices consists of two identical connected subgraphs. Given any assignment of labels to the vertices, then, we 
can simply exchange the two subgraphs and get the same term. So we should divide N\ by the product fT m 7 L 
But even this is not enough, because within each connected subgraph 7 there may be permutations which leave 
the integrand unchanged, as shown in the second and third examples in fig. 6.8. We define the symmetry factor 
s 7 as the number of permutations of the labels which leaves a given connected subgraphs 7 invariant. Examples 
of symmetry factors are shown in fig. 6.9. Consider, for example, the third subgraph in the top row. Clearly one 
can rotate the figure about its horizontal symmetry axis to obtain a new labeling which represents the same term. 
This twofold axis is the only symmetry the diagram possesses, hence s 7 = 2. For the first diagram in the second 
row, one can rotate either of the triangles about the horizontal symmetry axis. One can also rotate the figure in the 
plane by 180° so as to exchange the two triangles. Thus, there are 2x2x2 = 8 symmetry operations which result 
in the same term, and s 7 = 8. Finally, the last subgraph in the second row consists of five vertices each of which is 
connected to the other four. Therefore any permutation of the labels results in the same term, and s 7 = 5! = 120. 

In addition to dividing by the product Yl m -y !/ we must then also divide by J| 7 s™ 7 . 
We can now write the partition function as 

{rn T }ll TO 7 !s 7 7 \ i<3 J 

where the last product is over all links in the subgraph 7. The final Kronecker delta enforces the constraint 

N = J2-f m 7 n 7 - We next define the cluster integral b 1 as 

b y (T) = A • I fd\ ■ ■ ■ d d x^ n 4. . (6.69) 

7 i<j 

Since = f(\x i — x-\), the product YLl<j hj * s invariant under simultaneous translation of all the coordinate 
vectors by any constant vector, and hence the integral over the n y position variables contains exactly one factor of 
the volume, which cancels with the pref actor in the above definition of & 7 . Thus, each cluster integral is intensive, 
scaling as V^ . 5 

If we compute the grand partition function, then the fixed N constraint is relaxed, and we can do the sums: 



00 

7 m 7 =0 T 

cxp (V]>>^A^r^7 



Thus, 

n(T, v, n) = -vk B r (e"" \T d Y~' M T ) > ( 6 - 71 ) 

7 

and we can write 

P = k B Tj2{^ T d ) n ''M T ) (6-72) 



n 

1 



^n 7 (zA^)"^ 7 (T), (6.73) 



5 We assume that the long-ranged behavior of f(r) « —f3u(r) is integrable. 
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Figure 6.8: Different assignations of labels to vertices may not result in a distinct term in the expansion of the 
configuration integral. 
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Figure 6.9: The symmetry factor s 7 for a connected subgraph 7 is the number of permutations of its indices which 
leaves the term flfa-)^ /« invariant. 



where z = exp(/3/i) is the fugacity and where b t = 1. As we did in the case of ideal quantum gas statistical 
mechanics, we can systematically invert the relation n = n(z, T) to obtain z = z(n, T), and then insert this into the 
equation for p(z, T) to obtain the equation of state p = p(n, T). This yields the virial expansion of the equation of 
state, 



p = nfc B r{l + B 2 (T) n + B 3 (T) n 2 + . . . } 



(6.74) 
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6.3 A Cookbook recipe 

Just follow these simple steps! 

• The pressure and number density are written as an expansion over unlabeled connected clusters 7, viz. 

7 
7 

• For each term in each of these sums, draw the unlabeled connected cluster 7. 

• Assign labels 1,2, ... , n 1 to the vertices, where n 1 is the total number of vertices in the cluster 7. It doesn't 
matter how you assign the labels. 

• Write down the product JJ17< j fij ■ ^ ne f actor fij appears in the product if there is a link in your (now labeled) 
cluster between sites i and j. 

• The symmetry factor s 7 is the number of elements of the symmetric group S n which leave the product 
n7<j fij invariant. The identity permutation always leaves the product invariant, so s 7 > 1. 

• The cluster integral is 

MT) = -4/<*V"dVn/«- 

Due to translation invariance, bAT) oc V°. One can therefore set x n = 0, eliminate the volume factor from 
the denominator, and perform the integral over the remaining n 1 — 1 coordinates. 

• This procedure generates expansions for p(T, z) and n(T, z) in powers of the fugacity z = e' 3 ' 1 . To obtain 
something useful like p(T, n), we invert the equation n = n(T, z) to find z = z(T, n), and then substitute into 
the equation p = p(T, z) to obtain p = p(T, z(T, n)J = p(T, n). The result is the virial expansion, 

p = nk B Til + B 2 (T) n + B 3 (T) n 2 + . . . j . 

6.3.5 Lowest order expansion 

We have 

b-(T) = ± Jd d Xl fd d x 2 f(\ Xl - x 2 \) 
= i Jdbfir) 

and 

b A (T) = ± Jd d Xl Jd d x 2 Jd d x 3 f(\ Xl - x 2 \) f(\ Xl - x 3 \) 
= \ Jd^JdV f(r)f(r') = 2{b_) 2 



(6.75) 



(6.76) 
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and 



b A (T) = ± Jd d Xl Jd d x 2 Jd d x 3 /(K - x 3 |) f{\ Xl - x 3 \) f(\x 2 - x 3 \) 
fd d r fd d r' f(r)f(r')f(\r-r'\) . 



-n- 2b_n 2 + (2b 2 _ - 3b A ) n 3 + {n- 2b_n 2 ) 2 b_ + n 3 {2b 2 _ + b A ) + 0(n 4 ) 



k B T 

= n-b_n 2 - 2b A n 3 + 0{n i ) . 

Thus, 



(6.77) 



We may now write 

p = k B T^zX T d + (z\ T d ) 2 b_{T) + {z\^ d f ■ (6 A + b A ) + G(z 4 )} (6.78) 

n = z\ T d + 2(zX T d ) 2 b_(T) + 3(zX T d ) 3 ■ (b A + b A ) + 0{z 4 ) (6.79) 

We invert by writing 

zX T d = n + a 2 n 2 + a 3 n 3 + ... (6.80) 
and substituting into the equation for n(z, T), yielding 

n = (n + a 2 n 2 + a 3 n 3 ) + 2(n + a 2 n 2 ) 2 b_ + 3n 3 (b A + b A ) + 0(n 4 ) . (6.81) 

Thus, 

= (a 2 + 2b_) n 2 + (a 3 + 4a 2 b_ + 36 A + 36 A ) n 3 + ... . (6.82) 

We therefore conclude 

a 2 = -2b_ (6.83) 
a 3 = —4a 2 b_ — 36 A — 36 A 

= 8b 2 _ - 6b 2 _ - 3b A (6.84) 

= 2b 2 _ - 3b A . 

We now insert eqn. 6.80 with the determined values of a 2 3 into the equation for p(z, T), obtaining 



(6.85) 



B 2 (T) = -b_{T) , B 3 (T) = -2b A (T) . (6.86) 

6.3.6 Hard sphere gas in three dimensions 

The hard sphere potential is given by 

u(r) = { ~ (6.87) 
10 if r > a . 

Here a is the diameter of the spheres. The corresponding Mayer function is then temperature independent, and 
given by 

m = h * r l° (6.88) 
10 if r > a . 

We can change variables 

b_(T) = \td 3 r f(r) = -§7ra 3 . (6.89) 
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Figure 6.10: The overlap of hard sphere Mayer functions. The shaded volume is V. 

The calculation of 6 A is more challenging. We have 

b A = ±[d 3 p[d 3 rf(p)f(r)f(\r-p\) . 



(6.90) 

We must first compute the volume of overlap for spheres of radius a (recall a is the diameter of the constituent hard 
sphere particles) centered at and at p: 



(6.91) 



V= d 3 rf(r)f(\r-p\) 



= 2Jdz n{a 2 - z 2 ) = ^-a A - ira 2 p + ^ p J 

p/2 

We then integrate over region \p\ < a, to obtain 



b A = -\-^jdpp 2 - {f a 3_™ 2 p +^p 3 } 



¥« 6 



Thus, 



p = nk B T jl + ^a 3 n + ^-a 6 n 2 + C(n 3 )| 



(6.92) 



(6.93) 



6.3.7 Weakly attractive tail 

Suppose 

Then the corresponding Mayer function is 



u(r) = 



fir) 



oo if r < a 

—u (r) if r > a . 

— 1 if r < a 

e f3u (r)_ 1 if r>0 . 



(6.94) 



(6.95) 
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Thus, 

b -( T ) = l ld 3 rf{r) = -^a 3 + 2n jdrr 2 e""»< r »-l . 
Thus, the second virial coefficient is 

oo 

B 2 (T) = -b_(T) « f a 3 - Jdrr 2 u (r) , (6.97) 

a 

where we have assumed fc B T <c u (r). We see that the second virial coefficient changes sign at some temperature 
T , from a negative low temperature value to a positive high temperature value. 

6.3.8 Spherical potential well 

Consider an attractive spherical well potential with an infinitely repulsive core, 

{oo if r < a 
-e ifa<r<R (6.98) 
if r > R . 



Then the corresponding Mayer function is 



1 if r < a 

f(r) = { e P e -l if a < r < R (6.99) 
if r > R . 



Writing s = R/ a, we have 



B 2 (T) = -b_(T) = -\\d\f(r) 



1 



= "a \ ■ f a + ( e " X ) ' f « ( s ~ !) f 1 (6-100) 



fa 3 |l-( S 3 -l)(e^-l)} 



To find the temperature T where B 2 (T) changes sign, we set B 2 (T ) = and obtain 

s 3 '- 1 



fc B T = 6/ In I -J—) . (6.101) 



Recall in our study of the thermodynamics of the Joule-Thompson effect in §1.10.6 that the throttling process is 
isenthalpic. The temperature change, when a gas is pushed (or escapes) through a porous plug from a high pressure 
region to a low pressure one is 

P2 

'dr\ 



AT= ldp[—j , (6.102) 



Pi 
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u(r) 



a R a r 

Figure 6.11: An attractive spherical well with a repulsive core u(r) and its associated Mayer function /(r). 



where 



dT\ _L_ 

dp ) H C p 



(6.103) 



Appealing to the virial expansion, and working to lowest order in corrections to the ideal gas law, we have 



N N 2 
P=yKT+y^KTB 2 {T) 



and we compute (f^) by seting 



Nk T Nk 9N 2 N 2 

—£r d v + ^dT- — fc B^ 2 (r) + — d{k B TB 2 (T)) 



V 2 



V 



V 2 



(6.104) 



(6.105) 



Dividing by dT, we find 



1^1 



dB 2 

T ii- B2 



(6.106) 



The temperature where changes sign is called the inversion temperature T* . To find the inversion point, we 

set T* B' 2 {T*) = B 2 {T*),i.e. 

din B 



dlnT 

If we approximate B 2 (T) m A — y , then the inversion temperature follows simply: 



T* 



2B 
~A 



(6.107) 



(6.108) 



6.3.9 Hard spheres with a hard wall 



Consider a hard sphere gas in three dimensions in the presence of a hard wall at z = 0. The gas is confined to the 
region z > 0. The total potential energy is now 



W{x 1 , ... ,x N ) = Y^ vfa) + u ( x i - x 3 ) 



l< 3 



where 



. . I oo if z < \a 

v(r)=v{z) = i 2 

10 II Z > J a : 



(6.109) 



(6.110) 
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Figure 6.12: In the presence of a hard wall, the Mayer sphere is cut off on the side closest to the wall. The resulting 
density n(z) vanishes for z < |a since the center of each sphere must be at least one radius (\a) away from the 
wall. Between z = \a and z = |a there is a density enhancement. If the calculation were carried out to higher 
order, n(z) would exhibit damped spatial oscillations with wavelength A ~ a. 



and u(r) is given in eqn. 6.87. The grand potential is written as a series in the total particle number N, and is 
given by 

= ! + £ [d 3 r e-^« + ±£ 2 [d 3 r IdV e~ 0v ^ e ~^-r') + _ _ _ ^ (g m) 



where £, = z A T 3 , with z = e^-I^T ^ e fugacity. Taking the logarithm, and invoking the Taylor series ln(l + 5) 
. ., we obtain 



6-^ + 15* 



-(3t2 = £, d 3 r + 



\efd\ Jd\' 



-f}u(r—r') 



(6.112) 



The volume is V = J d?r. Dividing by V, we have, in the thermodynamic limit, 

z>0 



(in 



I3p = £, + U 2 i- ld\ /d¥ 



A' 



-0u(r-r') _ ^ 



z>% z'>i 



The number density is 



(6.113) 



(6.114) 



and inverting to obtain and then substituting into the pressure equation, we obtain the lowest order virial 
expansion for the equation of state, 

p = fc B Tjn+§7m 3 ?i 2 + ... j . (6.115) 
As expected, the presence of the wall does not affect a bulk property such as the equation of state. 
Next, let us compute the number density n(z), given by 

n(z) = (J2Hr-r i )). (6.116) 
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Due to translational invariance in the (x, y) plane, we know that the density must be a function of z alone. The 
presence of the wall at z = breaks translational symmetry in the z direction. The number density is 



n(z) = Tr 



A' 



Tr e P(^-H) 



(6.117) 



£ e -0v(z) e -Pv(z) d 3 r l e -l3v(z') \ ~fiu(r-r') _ j 



Note that the term in square brackets in the last line is the Mayer function fir — r') = e ^ U ^ T r ' — 1. Consider 
the function 



-fSv(z) p -pv(z') 



f(r - r') = 



if z < |a or z' < 



> a 




(6.118) 



> |a and z' > and \r — r'\ < a 



Now consider the integral of the above function with respect to r'. Clearly the result depends on the value of z. 
If z > | a, then there is no excluded region in r' and the integral is (—1) times the full Mayer sphere volume, i.e. 
— |7ra 3 . If z < ia the integral vanishes due to the e~^ v ^ factor. For z infinitesimally larger than |a, the integral 



is (—1) times half the Mayer sphere volume, i.e. — |7ra 3 . For z € [| , 4f] the integral interpolates between 
and — |7ra 3 . Explicitly, one finds by elementary integration, 



2 7TO 3 



dV e -* W e -MS) f[r _ r >) = [-1 - §(f - I) + \ (J - \) 



-|™ 3 



!™ 3 



if z < \a 
if \a < z < 
if z > |a . 



(6.119) 



After substituting £ 



■na n + 0(n ) to relate £ to the bulk density n = n^, we obtain the desired result: 



n(z) 




3f£ 
2\a 



V 3 



|7ra 3 n 2 



if z < ia 

if |a < z < |a 

if z > |a . 



(6.120) 



A sketch is provided in the right hand panel of fig. 6.12. Note that the density n(z) vanishes identically for z < | 
due to the exclusion of the hard spheres by the wall. For z between |a and |a, there is a density enhancement, 
the origin of which has a simple physical interpretation. Since the wall excludes particles from the region z < \, 
there is an empty slab of thickness hz coating the interior of the wall. There are then no particles in this region to 
exclude neighbors to their right, hence the density builds up just on the other side of this slab. The effect vanishes 
to the order of the calculation past z = |a, where n(z) = n returns to its bulk value. Had we calculated to higher 
order, we'd have found damped oscillations with spatial period A ~ a. 
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6.4 Lee- Yang Theory 

6.4.1 Analytic properties of the partition function 

How can statistical mechanics describe phase transitions? This question was addressed in some beautiful mathe- 
matical analysis by Lee and Yang 6 . Consider the grand partition function H, 

oo 

E(T,V,z)=J2 zN Qn(T,V)X^ n , (6.121) 

N=0 

where 

Qn(T,V) = ^ fd d Xl --- Jd d x N e- u ^ x "V k * T (6.122) 

is the contribution to the TV-particle partition function from the potential energy U (assuming no momentum- 
dependent potentials). For two-body central potentials, we have 

U(x 1 ,...,x N ) =^«(|a; i -a5 i |) . (6.123) 

Suppose further that these classical particles have hard cores. Then for any finite volume, there must be some 
maximum number N v such that Q N (T, V) vanishes for N > N v . This is because if N > N v at least two spheres 
must overlap, in which case the potential energy is infinite. The theoretical maximum packing density for hard 
spheres is achieved for a hexagonal close packed (HCP) lattice 7 , for which / HCP = ^= = 0.74048. If the spheres 
have radius r , then N v = V/ 4 \/2tq is the maximum particle number. 

Thus, if V itself is finite, then 3(T, V, z) is a finite degree polynomial in z, and may be factorized as 

N v N v 

S(T, V, z) = £ z w Qn(T, V) \? dN = J] 1 " - . (6-124) 

7V=0 k=l ^ Zk ' 

where z k (T, V) is one of the N v zeros of the grand partition function. Note that the O(z ) term is fixed to be unity. 
Note also that since the configuration integrals Q N (T, V) are all positive, S(z) is an increasing function along the 
positive real z axis. In addition, since the coefficients of z N in the polynomial 5(z) are all real, then S(z) = 
implies S(z) = = 0, so the zeros of S(z) are either real and negative or else come in complex conjugate pairs. 

For finite N v , the situation is roughly as depicted in the left panel of fig. 6.13, with a set of N v zeros arranged in 
complex conjugate pairs (or negative real values). The zeros aren't necessarily distributed along a circle as shown 
in the figure, though. They could be anywhere, so long as they are symmetrically distributed about the Re(z) axis, 
and no zeros occur for z real and nonnegative. 

Lee and Yang proved the existence of the limits 

/-= Urn llnSCT.V.z) (6.125) 



v d 
lim z — 

V-*oo dz 



i lnZ(T,V,z) 



(6.126) 



6 See C. N. Yang and R. D. Lee, Phys. Rev. 87, 404 (1952) and ibid, p. 410 

7 See e.g. http://en.wikipedia.org/wiki/Close-packing . For randomly close-packed hard spheres, one finds, from numerical simulations, 
!CP = 0.644. 
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Figure 6.13: In the thermodynamic limit, the grand partition function can develop a singularity at positive real 
fugacity z. The set of discrete zeros fuses into a branch cut. 



and notably the result 

" = (6127) 

which amounts to the commutativity of the thermodynamic limit V — > oo with the differential operator z . In 
particular, p(T, z) is a smooth function of z in regions free of roots. If the roots do coalesce and pinch the positive 
real axis, then then density n can be discontinuous, as in a first order phase transition, or a higher derivative 
d^p/dn^ can be discontinuous or divergent, as in a second order phase transition. 



6.4.2 Electrostatic analogy 

There is a beautiful analogy to the theory of two-dimensional electrostatics. We write 

1 N v / 



N v 

= -J2[<P(z-z k )-4,(0-z k )\, (6.128) 



fc=i 
where 

<t>(z) = -i ln(z) (6.129) 

is the complex potential due to a line charge of linear density A = V^ 1 located at origin. The number density is 
then 

8 / \ 1 Nv 



n = z 



to be evaluated for physical values of z, i.e. z £ R + . Since <j>(z) is analytic, 
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If we decompose the complex potential <p — <f> x + i<fi 2 into real and imaginary parts, the condition of analyticity is 
recast as the Cauchy-Riemann equations, 



Thus, 



dx dy dy dx ^ ^ 



dcj) 1 dcj) i dcj) 
~dz~ = ~2 fe + 2 ~dy~ 
l(d4> x dcj> 2 



2\dx dy J 2\dy dx J (6.133) 

dx dy 
= E X - zE y , 

where E = — 'V(p 1 is the electric field. Suppose, then, that as V — * oo a continuous charge distribution develops, 
which crosses the positive real z axis at a point x £ R + . Then 



-± = E x (x+) - E x {x-) = Ana(x) , (6.134) 

x 

where a is the linear charge density (assuming logarithmic two-dimensional potentials), or the two-dimensional 
charge density (if we extend the distribution along a third axis). 



6.4.3 Example 

As an example, consider the function 



(l + z) M (l-z M ) 
1 - z 

(l + z) M (l + z + z 2 + ... + z M - 1 ) . 



(6.135) 



The (2M — 1) degree polynomial has an M th order zero at z = — 1 and (M — 1) simple zeros at z = e 27rifc / M ; where 
k € {1, . . . , ill— 1}. Since M serves as the maximum particle number N v , we may assume that V = Mv , and the 
V — > oo limit may be taken as M — > oo. We then have 



— = lim — ln^,(z) 
k B T v^oo V y ' 

= — lim — In S(z) 

V Q M-*oo M 



(6.136) 



— lim — 

v M— >oo M 



Mln(l + z) + ln(l - z M ) - ln(l - z) 

The limit depends on whether \z\ > 1 or \z\ < 1, and we obtain 

'ln(l + z) 



P v o 
kT 



ln(l + z) + Inz 



if \z\ < 1 
if \z\ > 1 



(6.137) 
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Figure 6.14: Fugacity z and pvo/ k B T versus dimensionless specific volume v /vq for the example problem discussed 
in the text. 



Thus, 



d ( p 



dz \k B T 

If we solve for z(v), where v — n^ 1 , we find 



J_ . z 

V 1+2 



1+z 



if v > 2v n 



if \z\ < 1 
if \z\ > 1 



v «-' v if I„ < „ < 2,. 



(6.138) 



(6.139) 



We then obtain the equation of state, 



P v o 
KT 



In 



= < In 2 



11 (2v-v )- 



if v > 2u n 



if |u < v < 2v 



if < u < |w 



(6.140) 



6.5 Liquid State Physics 

6.5.1 The many-particle distribution function 

The virial expansion is typically applied to low-density systems. When the density is high, i.e. when na 3 ~ 1, 
where a is a typical molecular or atomic length scale, the virial expansion is impractical. There are to many terms 
to compute, and to make progress one must use sophisticated resummation techniques to investigate the high 
density regime. 



To elucidate the physics of liquids, it is useful to consider the properties of various correlation functions. These 
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objects are derived from the general iV-body Boltzmann distribution, 



'z^-j^e-? 11 "^ OCE 
f(x 1 ,...,x N ;p 1 ,...,p N ) = I (6.141) 



. j_ e (3nN e -t3H N (p,x) GCE _ 



We assume a Hamiltonian of the form 

N 



i=l 

The quantity 

,, d d x 1 d d Pl d d x N d d p N ttL-i A i\ 

f(x 1 ,...,x N ;p 1 ,...,p N ) — d — d (6.143) 

is the propability of finding N particles in the system, with particle #1 lying within d 3 x x oix 1 and having momen- 
tum within d d p 1 of p x , etc. If we compute averages of quantities which only depend on the positions {x^} and not 
on the momenta { P j}, then we may integrate out the momenta to obtain, in the OCE, 

P( Xl , ...,x N ) =Q- X ~ e - • ■■■>**) , (6.144) 
where W is the total potential energy, 

W(x 1 ,...,x N )=Y^v(x i ) + ^2u(x i -x j )+ w(x i -x j ,x j -x k ) + ... , (6.145) 

i i<j i<j<k 

and Q N is the configuration integral, 

Q N (T,V) = ± •••/'/•'• V iU ^ ''^ • (6.146) 

We will, for the most part, consider only two-body central potentials as contributing to W , which is to say we 
will only retain the middle term on the RHS. Note that P( Xl , . . . , x N ) is invariant under any permutation of the 
particle labels. 

6.5.2 Averages over the distribution 

To compute an average, one integrates over the distribution: 

(F(x 1 ,...,x N )) = Jd d x 1 --- Jd d x N P(x 1 , ... , x N )F(x 1 , ... ,x 



N I 



(6.147) 



The overall iV-particle probability density is normalized according to 

Jd d x N P(x 1: ...,x N ) = 1 . 

The average local density is 

i 

= N Jd d x 2 ■ ■ ■ Jd d x N P(r,x 2 , . . . ,x 



(6.148) 



(6.149) 



Nl ■ 
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Note that the local density obeys the sum rule 

ld d rn 1 (r) = N. (6.150) 



In a translationally invariant system, n 1 = n = is a constant independent of position. The boundaries of 
a system will in general break translational invariance, so in order to maintain the notion of a translationally 
invariant system of finite total volume, one must impose periodic boundary conditions. 

The two-particle density matrix n 2 (r 1 , r 2 ) is defined by 

n 2 (r 1 ,r 2 ) = (X^( r i _ x i) s ( r 2 - x j)) 

(6-151) 

= N(N -I) j <!%■■■ J d d x N P(r x , r 2) x 3 ,...,x N ). 

As in the case of the one-particle density matrix, i.e. the local density n 1 (r), the two-particle density matrix satisfies 
a sum rule: 

r d\[d%n 2 ( ri ,r 2 ) = N(N - 1) . (6.152) 

Generalizing further, one defines the k-particle density matrix as 

n k (ri,--.,r k ) = ( ^ 5(r 1 - x^) ■ ■ ■ 6(r k - x ik ) ) 

<1 '" < * (6.153) 



(JV-jfe)! 



Jd d x k+1 ■ ■ ■ Jd d x N P{r 1 , . . . , r k7 x k+1 , . . . , x N ) , 



where the prime on the sum indicates that all the indices i 1 ,...,i k are distinct. The corresponding sum rule is 
then 

fd\ ■ ■ ■ Jd d r k n k ( ri , ...,r k ) = . (6.154) 

The average potential energy can be expressed in terms of the distribution functions. Assuming only two-body 
interactions, we have 



(W) = (J2u(x i -x j )) 

i<j 

= 1 J d ^i [ d % u ( r i ~ r ^(J2 <^ r i - x i> ^ - x j)) (6.155) 



Vr-! U d r 2 u(r 1 -r 2 )n 2 {r ll r 2 ) . 



2 



As the separations =.\r i — | get large, we expect the correlations to vanish, in which case 



AM 



(6.156) 



:n 1 (r 1 )---n 1 (r k ) 



(N-k)\ N k 
= ( 1 -^)( 1 -|)-( 1 -^ 1 ) 
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The k-p 'article distribution function is defined as the ratio 



For large separations, then, 

5fe (r lJ ...,r fc )-— 111 1 -^)- ( 6 - 158 ) 



fc-i / 



3=1 



For isotropic systems, the two-particle distribution function g 2 (i'i, r 'i) depends only on the magnitude \r 1 — r 2 \. 
As a function of this scalar separation, the function is known as the radial distribution function: 



l(r) =g 2 {r) = -5 (^Sir-x^Sixj)) 



(6.159) 



Vn 2 

The radial distribution function is of great importance in the physics of liquids because 

• thermodynamic properties of the system can be related to g(r) 

• g(r) is directly measurable by scattering experiments 

For example, in an isotropic system the average potential energy is given by 

(W) = \L d\ [ d d r 2 u( ri -r 2 )n 2 (r, , r 2 ) 



\r?^d d r 1 /dSufo-ra^flri-raQ (6.160) 



2 

:2 



/ d\ u(r) g(r) 



2V. 

For a three-dimensional system, the average internal (i.e. potential) energy per particle is 

oo 

= 2im j r 2 g{r) u(r) . (6.161) 
o 

Intuitively, f(r) dr = A-kt 2 n g(r) dr is the average number of particles lying at a radial distance between r and 
r + dr from a given reference particle. The total potential energy of interaction with the reference particle is then 
f{r) u(r) dr. Now integrate over all r and divide by two to avoid double-counting. This recovers eqn. 6.161. 



In the OCE, g(r) obeys the sum rule 



d 4 rg(r) = ^ 2 --N(N-l) = V~^, (6.162) 



hence 

n Jdfr [g(r) - l] = -1 (OCE) . (6.163) 
The function h(r) = g(r) - 1 is called the pair correlation function. 
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36 



Ar at 85 K 
= 0.02125 /A 3 




Figure 6.15: Pair distribution functions for hard spheres of diameter a at filling fraction r\ = n = 0.49 (left) 
and for liquid Argon at T = 85 K (right). Molecular dynamics data for hard spheres (points) is compared with 
the result of the Percus-Yevick approximation (see below in §6.5.8). Reproduced (without permission) from J. -P. 
Hansen and I. R. McDonald, Theory of Simple Liquids, fig 5.5. Experimental data on liquid argon are from the 
neutron scattering work of J. L. Yarnell et ah, Phys. Rev. A 7, 2130 (1973). The data (points) are compared with 
molecular dynamics calculations by Verlet (1967) for a Lennard-Jones fluid. 



In the grand canonical formulation, we have 



n d 3 rh(r) = 



(N) 
V 



(N(N-l)) 



V -V 



W - « 

(N) 



= nk B Tn T — 1 



(6.164) 



(GCE) 



where k t is the isothermal compressibility. Note that in an ideal gas we have h(r) = and n T = = l/nk B T. 
Self-condensed systems, such as liquids and solids far from criticality are nearly incompressible, hence < 
nk B T k t <C 1, and therefore nfcfirh(r) ~ —1. For incompressible systems, where k t = 0, this becomes an 
equality. 

As we shall see below in §6.5.4, the function h(r), or rather its Fourier transform h(k), is directly measured in 
a scattering experiment. The question then arises as to which result applies: the OCE result from eqn. 6.163 or 
the GCE result from eqn. 6.164. The answer is that under almost all experimental conditions it is the GCE result 
which applies. The reason for this is that the scattering experiment typically illuminates only a subset of the entire 
system. This subsystem is in particle equilibrium with the remainder of the system, hence it is appropriate to use 
the grand canonical ensemble. The OCE results would only apply if the scattering experiment were to measure 
the entire system. 
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H 2 pair distributions 




r 



A 



Figure 6.16: Monte Carlo pair distribution functions for liquid water. From A. K. Soper, Chem Phys. 202, 295 
(1996). 

6.5.3 Virial equation of state 

The virial of a mechanical system is defined to be 



i 

where F i is the total force acting on particle i. If we average G over time, we obtain 

T 



T 



(6.165) 



(6.166) 



lim 



Here, we have made use of 



= -3NKT 



X:-F: = mXi- ib: = — mxf H ( m x 4 ■ x 

at 



(6.167) 



as well as ergodicity and equipartition of kinetic energy We have also assumed three space dimensions. In a 
bounded system, there are two contributions to the force F i . One contribution is from the surfaces which enclose 
the system. This is given by 8 



(G) s 



rfaccs 



(E 



x, • F- 



(surf) 



) = SpV . 



(6.168) 



8 To derive this expression, note that i^( surf ) is directed inward and vanishes away from the surface. Each Cartesian direction a = (x, y, z) 

then contributes — Fa Surf ' L a , where L a is the corresponding linear dimension. But F^^^ = pA a , where A a is the area of the corresponding 
face and p. is the pressure. Summing over the three possibilities for a, one obtains eqn. 6.168. 



266 CHAPTER 6. CLASSICAL INTERACTING SYSTEMS 



The remaining contribution is due to the interparticle forces. Thus, 

B B i 

Invoking the definition of g(r), we have 



p = nk B T < 1 



2im 



dr r 3 g{r) u (r) 



(6.169) 



(6.170) 



As an alternate derivation, consider the First Law of Thermodynamics, 

df} = -SdT -pdV - Ndfi , 

from which we derive 

P = 



W). 

Now let V ^ £ 3 V, where £ is a scale parameter. Then 



dVJ, 



T.N 



dfl _ Id 
&V ~ ~W d£ 



n{T,e 3 v,iJL) 



Now 



N=0 



Thus, 



1 df2{t 3 V) 



W d£ 
KT 1 



i=i 



k B T 1 d~(£ 3 V) 
3V S d£ 



3V 

nkJT 



1 at n ' I ** J 



\ nG -I3W( Xi ,..., Xn ) 



N=0 

1 idW 
W \~di/e=i 
Finally, from W = Y^i<j u {^ x ij) we have 



dW 



~d£/t=i 



E^y ' Vnfiij) 



•<3 



27riV' 



dr r 3 <?(r) u'(r) , 



and hence 



r 



p = nk B T — §7to. 2 / dr r 3 g{r) u'(r) . 



(6.171) 
(6.172) 

(6.173) 



(6.174) 



(6.175) 



(6.176) 



(6.177) 



Note that the density n enters the equation of state explicitly on the RHS of the above equation, but also implicitly 
through the pair distribution function g(r), which has implicit dependence on both n and T. 
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Figure 6.17: In a scattering experiment, a beam of particles interacts with a sample and the beam particles scatter 
off the sample particles. A momentum hq and energy hio are transferred to the beam particle during such a 
collision. If u = 0, the scattering is said to be elastic. For w^O, the scattering is inelastic. 



6.5.4 Correlations and scattering 



Consider the scattering of a light or particle beam (i.e. photons or neutrons) from a liquid. We label the states of 
the beam particles by their wavevector k and we assume a general dispersion e fc . For photons, e fc = ftc|fc|, while 

for neutrons e k = ■ We assume a single scattering process with the liquid, during which the total momentum 
and energy of the liquid plus beam are conserved. We write 

k' = k + q (6.178) 
e k , =e k + hw , (6.179) 

where k! is the final state of the scattered beam particle. Thus, the fluid transfers momentum Ap = hq and energy 
hw to the beam. 

Now consider the scattering process between an initial state \i,k) and a final state | j, kl ), where these states 
describe both the beam and the liquid. According to Fermi's Golden Rule, the scattering rate is 

r ik ^ jk > = ^-\(hk'\V\i,k)\ 2 5^- E i + hu) , (6.180) 

where V is the scattering potential and E i is the initial internal energy of the liquid. If r is the position of the beam 
particle and {x t } are the positions of the liquid particles, then 

N 

V(r) = 5>(r - *,) ■ (6-181) 

i=i 

The differential scattering cross section (per unit frequency per unit solid angle) is 
where 

9{e) = f^S^e-e h ) (6.183) 
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is the density of states for the beam particle and 



P i = i c -^. (6.184) 



Consider now the matrix element 

N 



(j,k'\V\i,k} = (j\±Yl [dtre^W v(r- Xl )\i) 

i i <* 



i=i 

1 N 

2=1 

where we have assumed that the incident and scattered beams are plane waves. We then have 

9 2 Q- h g( £ fc+g) \v(q)\ 2 \ - V^l/.l^ - iq . x . I .\|2 , 

97^ = 27^-^E^EIOIE e 'l'>l W-^+M 

-j|«(q)| 2 5( 9 ,cj) , 



4-|V fc e fc | ^ 2 



Thus, 



oo 

OO 



(6.185) 



(6.186) 



where w) is the dynamic structure factor, 

9 h N 
S(q, u>) = -^ Yl P i E I ( 3 I E | 1' ^ - ^ + M ( 6 - 187 ) 

Note that for an arbitrary operator A, 

oo 

^2\(j\A\i)\ 2 S(E j -E i + nw) = ^Yl jdte^~ E ^^ {i\A^\j){j\A\i) 

3 3 — oo 

oo 

= _L £ /rffc c*"* < * I At I i > <j I e*^*/* Ae- <A */ h | i > 

— OO 

OO 

= 2^hJ dt ^ ^ I Af ^ A ^ I * ) • (6 ' 188) 



(6.189) 



where the angular brackets in the last line denote a thermal expectation value of a quantum mechanical operator. 
If we integrate over all frequencies, we obtain the equal time correlator, 

oo 

M' (6.190) 

= N6 qfi + l + n[d' i re- i i- r [g(r)-l] . 
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3 



i 1 1 1 

EXPERIMENTAL DATA 
T = 85K , n = 0.0213 /A 3 

MOLECULAR DYNAMICS 
T = 86.6K , 7i = 0.02138 /A 3 




12 



Figure 6.18: Comparison of the static structure factor as determined by neutron scattering work of J. L. Yarnell et 
ah, Phys. Rev. A 7, 2130 (1973) with molecular dynamics calculations by Verlet (1967) for a Lennard-Jones fluid. 



known as the static structure factor 9 . Note that S(q = 0) = N, since all the phases e iq '^ Xi ~ x ^ are then unity. As 
q — ► oo, the phases oscillate rapidly with changes in the distances \x i — xA, and average out to zero. However, the 
'diagonal' terms in the sum, i.e. those with i = j, always contribute a total of 1 to S(q). Therefore in the q — > oo 
limit we have S(q — > oo) = 1. 

In general, the detectors used in a scattering experiment are sensitive to the energy of the scattered beam particles, 
although there is always a finite experimental resolution, both in q and u>. This means that what is measured is 
actually something like 

S meas (q,u;) = jd d q' [ dw' F(q - q') G(u - u>') S(q' ,u>') , (6.191) 

where F and G are essentially Gaussian functions of their argument, with width given by the experimental reso- 
lution. If one integrates over all frequencies u>, i.e. if one simply counts scattered particles as a function of q but 
without any discrimination of their energies, then one measures the static structure factor S(q). Elastic scattering 
is determined by S(q, ui = 0, i.e. no energy transfer. 

6.5.5 Correlation and response 



Suppose an external potential v(x) is also present. Then 

1 

v\ ' N\ 



P( Xl , ... ,x N ) = — l — . 1 -W^.-.^e-ZSEi"^) j ( 6 .i92) 



where 



Q N [v] - fd\--- f^e-^-'^e-^'W . (6.193) 



9 We may write S g Q = ± (2n) d S(q). 
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The Helmholtz free energy is then 

F = ~]n(x^ dN Q N [v}) . (6.194) 

Now consider the functional derivative 

SF 1 1 5Q N 



5v(r) (3 Q N 5v(r) 



Using 



hence 

SF 



Sv(r) 



which is the local density at r. 
Next, consider the response function, 

X(r,r') : 



Sn^r) 6 2 F[v] 



Taking the Fourier transform, 



We may also write 



-k B Tx(r - r') = -n 2 + nd{r - r') + n 2 g(\r - r'\) 
= n 2 h{\r - r'\) + n 5(r - r') . 

~k B Tx(q) =n + n 2 h(q) 
= nS{q). 



(6.195) 



]T v( Xi ) = fd^r v(r) Y, S(r -x,), (6.196) 
i J i 

Jd d x 1 ---Jd d x N P(x l7 ... , x N ) J2 S ( r - x i) 

ni (r) , (6.197) 



Sv(r') Sv(r) 5v(r') 

1 J_ SQ N SQ N _ 1 J_ S^jf (6.198) 
~ ' Q 2 N 5v{r) Sv{r>) ' Q N Sv(r) Sv{r>) 

= /3n 1 (r) n 1 (r') — /3n 1 (r) S(r — r') — n 2 (r, r') . 

In an isotropic system, X(r, r') = X(r — r') is a function of the coordinate separation, and 



(6.199) 



(6.200) 



£ = 1 + n h{0) = -nk B T X (0) , (6.201) 



i.e. Krp = — X(0). 

What does this all mean? Suppose we have an isotropic system which is subjected to a weak, spatially inhomoge- 
neous potential v(r). We expect that the density n(r) in the presence of the inhomogeneous potential to itself be 
inhomogeneous. The first corrections to the v = value n = n are linear in v, and given by 

Sn(r) = /dV X(r,r')v(r') 

J (6.202) 
= -0n o v(r) - 0n 2 o \ dV h{r - r) v{r') . 
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Note that if v(r ) > it becomes energetically more costly for a particle to be at r . Accordingly the density response 
is negative, and proportional to the ratio w(r)/fc B T - this is the first term in the above equation. If there were no 
correlations between the particles, then h = and this would be the entire story However, the particles in general 
are correlated. Consider, for example, the case of hard spheres of diameter a, and let there be a repulsive potential 
at r = 0. This means that it is less likely for a particle to be centered anywhere within a distance a of the origin. 
But then it will be more likely to find a particle in the next 'shell' of radial thickness a. 



6.5.6 BBGKY hierarchy 

The distribution functions satisfy a hierarchy of integro-differential equations known as the BBGKY hierarchy 10 . In 
homogeneous systems, we have 



N\ 1 



(N - k) 



•)! J ^k+i ' ' J d d x N P(r 1 , ... , r k , x k+1 , . . . , x N ) , 



(6.203) 



where 



r [Xl , ... ,x N )- Q ^ m e 



(6.204) 



Taking the gradient with respect to r x , we have 



d In 

g^rdkin,... , r k ) = — ■ fc) , 

d 
dr 1 



-,d d x k+1 ---ld d x N e-^*<^^ 



(6.205) 



where J2 k<i< j means to sum on indices i and j such that i < j and k < i, i.e. 



N-l N 

k<i<j i— fc+1 j=i+l 

k-1 k 

i<j<k z—1 j=i+l 

fc JV 

Z "fa" Z «( r i- !B j) • 

i<k<j i=l j=k+l 



Now 



g 
9r 1 



U<j<fe 1 k<j 1 



,-0 «(n 3 -) .e-PEKKi"^-^) 



(6.206) 



10 So named after Bogoliubov, Born, Green, Kirkwood, and Yvon. 
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hence 



Q * du(r, - r,-) 

— g k ( ri ,...,r k ) = L 3 9 k (n ,...,r k ) (6.207) 

1 j=2 1 

- p(N - k) Jd d x k+1 du{ri dr * k+l) P(r a ,...,r k , x k+1 ,...,x N ) 
J\ du(r, — r 4 ) 

= -f 3 H L gfc( r i ■ ■ ■ ■ .*•*) ( 6 - 208 ) 

J'=2 1 



+ njd x k+1 — 9k+iVi i ■ ■ ■ i r k i x k+i) 

Thus, we obtain the BBGKY hierarchy: 

, .. . , r k ) = J2 dU{r L~ rj) Skir, , . . . , r fc ) (6.209) 

1 j=2 1 

9^ n..r'). 

The BBGKY hierarchy is an infinite tower of coupled integro-differential equations, relating g k to g k+1 for all k. If 
we approximate g fc at some level k in terms of equal or lower order distributions, then we obtain a closed set of 
equations which in principle can be solved, at least numerically. For example, the Kirkzvood approximation closes 
the hierarchy at order k = 2 by imposing the condition 

9 3 ( r i , r 2 , r 3 ) = g(r 1 - r 2 ) g(r 1 - r 3 ) g(r 2 - r 2 ) . (6.210) 
This results in the single integro-differential equation 

-fc B T Vg(r) = g(r) Vu + nJ d d r' g(r) g{r') g(r - r') Vu(r - r') . (6.211) 

This is known as the Born-Green- Yvon (BGY) equation. In practice, the BGY equation, which is solved numerically, 
gives adequate results only at low densities. 



6.5.7 Ornstein-Zernike theory 

The direct correlation function c(r) is defined by the equation 

h(r) = c(r) + n [ d 3 r' h(r - r') c(r') , (6.212) 



where h(r) = g(r) — 1 and we assume an isotropic system. This is called the Ornstein-Zernike equation. The first 
term, c(r), accounts for local correlations, which are then propagated in the second term to account for long- 
ranged correlations. 

The OZ equation is an integral equation, but it becomes a simple algebraic one upon Fourier transforming: 

h(q)=c(q)+nh(q)c(q) , (6.213) 

the solution of which is 

Hi) = ; C(q \ ; ■ (6-214) 
1 — nc(q) 
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The static structure factor is then 



S(q) = 1 + n Mg) = — . (6.215) 

1 — n c{q) 



In the grand canonical ensemble, we can write 

l + nh(0) 1 1 



nk B T nk B T 1 — nc(0) 
where = l/nk B T is the ideal gas isothermal compressibility. 



nc(0) = l-^, (6.216) 



At this point, we have merely substituted one unknown function, h(r), for another, namely c(r). To close the 
system, we need to relate c(r ) to h(r) again in some way. There are various approximation schemes which do just 
this. 



6.5.8 Percus-Yevick equation 

In the Percus-Yevick approximation, we take 

c{r) = [1 - e 0u{r) ] ■ g{r) . (6.217) 

Note that c(r) vanishes whenever the potential u(r) itself vanishes. This results in the following integro-differential 
equation for the pair distribution function g(r): 

g{r) = e~^ r) +ne~ 0u ^ [dV [g(r - r') - l] • [l - e^ (r '>] g{r') . (6.218) 



This is the Percus-Yevick equation. Remarkably, the Percus-Yevick (PY) equation can be solved analytically for the 
case of hard spheres, where u(r) = oo for r < a and u(r) = for r > a, where a is the hard sphere diameter. Define 
the function y(r) = eP u ^ g(r), in which case 

c(r)=y(r)f(r) = r y{r) ' " ~ ° (6.219) 
10 , r > a . 

Here, /(r) = e~ l3ul - r ' > — 1 is the Mayer function. We remark that the definition of y(r) may cause some concern for 
the hard sphere system, because of the e 13 ' 11 ^ term, which diverges severely for r < a. However, g{r) vanishes in 
this limit, and their product y(r) is in fact finite! The PY equation may then be written for the function y(r) as 

y(r) = l + njd 3 r' y{r') -n J dV y(r') y(r - r') . (6.220) 

r'<a r' <a 

r— r' [>a 

This has been solved using Laplace transform methods by M. S. Wertheim, /. Math. Phys. 5, 643 (1964). The final 
result for c(r) is 

c(r) = - |a x + 6r7 A 2 (£j + \v Ai {£) 'j • 0(o - r) , (6.221) 

where r) = ^TTa 3 n is the packing fraction and 

(1 + 2^ _ (1 + H 2 f6222 , 

1 ~ TT^" ' A2 -~ (l-r,)4 ' (6 - 222) 

This leads to the equation of state 

l + i] + r] 2 
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quantity 


exact 


PY 


HNC 




0.28695 


0.2969 


0.2092 


B B /m 


0.1103 


0.1211 


0.0493 




0.0386 


0.0281 


0.0449 


B 7 /B« 


0.0138 


0.0156 





Table 6.1: Comparison of exact (Monte Carlo) results to those of the Percus-Yevick (PY) and hypernetted chains 
approximation (HCA) for hard spheres in three dimensions. Sources: Hansen and McDonald (1990) and Reichl 
(1998) 



This gets B 2 and B 3 exactly right. The accuracy of the PY approximation for higher order virial coefficients is 
shown in table 6.1. 

To obtain the equation of state from eqn. 6.221, we invoke the compressibility equation, 

nk « TK ^{W T = ^hm- (6 - 224) 

We therefore need 



:(0) = Jd 6 rc{r) 
l 

= -4na 3 J dxx 2 [\ x + 6 y A 2 x + \ ?y \ x x 3 ] {6225) 



= -Ana 3 [iAj + § 77 A 2 + j 2 ijX 1 ] ■ 
With rj = ^na 3 n and using the definitions of X 1 2 in eqn. 6.222, one finds 



1 + 4?? + 4?y 2 

1 - n c = — . (6.226) 

(1 - V) 



We then have, from the compressibility equation, 

6k B T dp 1 + 4?7 + 4?7 2 



(6.227) 



7ra 3 drj (1 — ?y) 4 

Integrating, we obtain p(rj) up to a constant. The constant is set so thatp = when n = 0. The result is eqn. 6.223. 
Another commonly used scheme is the hypernetted chains (HNC) approximation, for which 

c(r) = -/3u(r) + h(r) - ln(l + h{rfj . (6.228) 

The rationale behind the HNC and other such approximation schemes is rooted in diagrammatic approaches, 
which are extensions of the Mayer cluster expansion to the computation of correlation functions. For details and 
references to their application in the literature, see Hansen and McDonald (1990) and Reichl (1998). 



6.5.9 Ornstein-Zernike approximation at long wavelengths 



Let's expand the direct correlation function c(q) in powers of the wavevector q, viz. 

c{q) = c(0) + c 2 q 2 + c 4 q i + ... . 



(6.229) 
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Here we have assumed spatial isotropy. Then 



1 — n c(q) = — ^— = 1 - n c(0) - n c 2 q 2 + . . . 

S(q) (6.230) 

= C 2 R 2 + q 2 R 2 + 0(q 4 ) , 



where 



oo 



R 2 = -nc 2 = 2im J dr r 4 c(r) (6.231) 
o 

and 

. 2 1 - n c(0) 1 - 4tto f°°dr r 2 c(r) 

r 2 = = -= f ~° 4 , ' ■ (6.232) 

/t z 27rn J Q ar r 4 c(rj 

The quantity R(T) tells us something about the effective range of the interactions, while £(T) is the correlation 
length. As we approach a critical point, the correlation length diverges as a power law: 

£(T) ~ A\T - T c \~ v . (6.233) 

The susceptibility is given by 



nj3R' 
^- 2 Tq^+0{q 4 ) 

In the Ornstein-Zernike approximation, one drops the 0(q A ) terms in the denominator and retains only the long 
wavelength behavior, in the direct correlation function. Thus, 

n(3R- 2 
I^Tq 2 



%) = -nPS(q) = - e _ 2 * Mn( ., (6-234) 



x (*?) = r 2 _i 2 ■ (6.235) 



We now apply the inverse Fourier transform back to real space to obtain X oz (r). In d = 1 dimension the result can 
be obtained exactly: 



„, , , n f dq e 1 



k B TR 2 J 2tt £" 2 + q 2 



(6.236) 



. e -N/£ 



2fc B Ti? 2 

In higher dimensions d > 1 we can obtain the result asymptotically in two limits: 
• Take r — > 00 with £ fixed. Then 

*"<-> - - c *» ■ W ■ ^ ■ I 1 + °(— ) } • (6 - 237) 

where the C d are dimensionless constants. 

Take £ — ► 00 with r fixed; this is the limit T — ► T c at fixed r. In dimensions d > 2 we obtain 



In cZ = 2 dimensions we obtain 



where the C' d are dimensionless constants. 
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At criticality, £ — ► oo, and clearly our results in d = 1 and d = 2 dimensions are nonsensical, as they are divergent. 
To correct this behavior, M. E. Fisher in 1963 suggested that the OZ correlation functions in the r <C £ limit be 
replaced by 

x u*-^ n -i$*-££i> (6 - 240) 

a result known as anomalous scaling. Here, r] is the anomalous scaling exponent. 

Recall that the isothermal compressibility is given by k t = — X(0). Near criticality, the integral in X(0) is dominated 
by the r <C £ part, since £ — * oo. Thus, using Fisher's anomalous scaling, 

ti T = -X(0) = - fd*rX(r) 



_ , (6.241) 
„ A [d*r X^r ~ B£ 2 -" ~ C IT- T c r (2 -" )y , 

where A, _B, and C are temperature-dependent constants which are nonsingular at T = T c . Thus, since k t oc 
|r-T c |-T,we conclude 

7 = (2 - n) v , (6.242) 

a result known as hyper scaling. 



6.6 Coulomb Systems : Plasmas and the Electron Gas 



6.6.1 Electrostatic potential 

Coulomb systems are particularly interesting in statistical mechanics because of their long-ranged forces, which 
result in the phenomenon of screening. Long-ranged forces wreak havoc with the Mayer cluster expansion, since 
the Mayer function is no longer integrable. Thus, the virial expansion fails, and new techniques need to be applied 
to reveal the physics of plasmas. 

The potential energy of a Coulomb system is 

U =\jd d rj dV p{r) u(r- r 1 ) p(r') , (6.243) 

where p(r) is the charge density and u(r), which has the dimensions of (energy) /(charge) 2 , satisfies 

V 2 u(r - r') = -4tt 5{r - r') . (6.244) 

Thus, 

-2tt\x -x'\ , d = 1 



u(r) = < -2 ln|r - r'\ , d = 2 



, d= 3 



(6.245) 



For discete particles, the charge density p(r) is given by 



(6.246) 
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where q i is the charge of the i th particle. We will assume two types of charges: q = ±e, with e > 0. The electric 
potential is 



(6.247) 



(r) = I dV u(r - r') p(r') 



This satisfies the Poisson equation, 

V 2 0(r) = -47rp(r) . (6.248) 

The total potential energy can be written as 

U =\Jd\(t){r)p{r) (6.249) 
= ±]T^K;), (6.250) 

i 

6.6.2 Debye-Huckel theory 

We now write the grand partition function: 



= -\ yd'Vp av (r)0 av (r) + J dff av (r) p{r)+ \ J d d r 8p{r) 5(j>{r) 
We apply the mean field approximation in each region of space, which leads to 

f2(T,V,ii + ,n_) = -KT\- + d z + Jd\ exp(- e -^P^j 

- k B T\Z d z_Jd d r exp( 

where 



e0 av (r) 
KT 



(6.251) 



NJ N_ 
■fd d r 1 ...fd% ++N _e- mr --^- ) . 

We now adopt a mean field approach, known as Debye-Htickel theory, writing 

p(r) = p av (r) + Sp(r) (6.252) 

4>{r) = av (r) +5</>(r) . (6.253) 

We then have 

U=\ Jd d r [p av (r) + Sp(r)] • [0 av (r) + <ty(r)] 

= U Q ignore fluctuation term (6.254) 



(6.255) 
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The charge density is therefore 

*) - j&y - V* ■»(- ^) - '-$) ■ (6 - 257) 

where we have now dropped the superscript on av (r) for convenience. At r — > oo, we assume charge neutrality 
and 0(oo) = 0. Thus 

A+ d z + = n + (00) = Al d z_ = n_ (00) = , (6.258) 
where is the ionic density of either species at infinity. Therefore, 

P (r) =-2e noo sinhf^M) . (6.259) 

We now invoke Poisson's equation, 

V 2 = Siren^ sinh(/3e</>) - 47rp ext , (6.260) 
where p ext is an externally imposed charge density. 
If ecf) <C fc B ^' we can expand the sinh function and obtain 

V 2 = K 2 D - 4^p cxt , (6.261) 

where 

1*™^" A D =r^V' 2 . (6-262) 



The quantity A D is known as the Debye screening length. Consider, for example, a point charge Q located at the 
origin. We then solve Poisson's equation in the weak field limit, 

V 2 = K 2 D 4>-ATiQ5{r) . (6.263) 

Fourier transforming, we obtain 

-g 2 0(g) = >4 0(g) -4ttQ $( q ) = -i^ . (6.264) 

g + «d 

Transforming back to real space, we obtain, in three dimensions, the Yukawa potential, 

^ = iR7T4 = -' e (6 - 265) 

This solution must break down sufficiently close to r = 0, since the assumption e0(r) -c k B T is no longer valid 
there. However, for larger r, the Yukawa form is increasingly accurate. 

For another example, consider an electrolyte held between two conducting plates, one at potential <f>{x = 0) = 
and the other at potential <j>(x = L) = V, where x is normal to the plane of the plates. Again assuming a weak 
field ecfi <C k B T, we solve V 2 </) = k 2 </> and obtain 

<()(x) =Ae K ° x +Be~ K ° x , (6.266) 

We fix the constants A and B by invoking the boundary conditions, which results in 

sinh(K D x) 

= V ' ^T7 — FT ' (6.267) 

Debye-Hiickel theory is valid provided X 3 » 1, so that the statistical assumption of many charges in a screen- 
ing volume is justified. 
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6.6.3 The electron gas : Thomas-Fermi screening 



Assuming k s T <C e F , thermal fluctuations are unimportant and we may assume T = 0. In the same spirit as the 
Debye-Huckel approach, we assume a slowly varying mean electrostatic potential <p(r). Locally we can write 



2 7,2 



e F = — — - e<p{r) . 
2m 

Thus, the Fermi wavevector fc F is spatially varying, according to the relation 



k F (r) 



2to ( i/ \ 



1/2 



The local electron number density is 



, . fcf(r) / eS(r) 

< r ) = = r V< 1 



3/2 



(6.268) 



(6.269) 



(6.270) 



In the presence of a uniform compensating positive background charge p + = en OT , Poisson's equation takes the 
form 



V <b = 4ire n n 



1 



e<j)(r) 



3/2 



1 



47r PextO) • 



If ecf> <C e F , we may expand in powers of the ratio, obtaining 



cf) = K TF 0-47rp ext (r) 



(6.271) 



(6.272) 



Here, k tv is the Thomas-Fermi wavevector, 



St, 



2\l/2 



(6.273) 



Thomas-Fermi theory is valid provided A^, F ^> 1, where A TF = , so that the statistical assumption of many 
electrons in a screening volume is justified. 

One important application of Thomas-Fermi screening is to the theory of metals. In a metal, the outer, valence 
electrons of each atom are stripped away from the positively charged ionic core and enter into itinerant, plane- 
wave-like states. These states disperse with some e(k) function (that is periodic in the Brillouin zone, i.e. under 
k — > k + G, where G is a reciprocal lattice vector), and at T = this energy band is filled up to the Fermi level e F , 
as Fermi statistics dictates. (In some cases, there may be several bands at the Fermi level, as we saw in the case 
of yttrium.) The set of ionic cores then acts as a neutralizing positive background. In a perfect crystal, the ionic 
cores are distributed periodically, and the positive background is approximately uniform. A charged impurity in 
a metal, such as a zinc atom in a copper matrix, has a different nuclear charge and a different valency than the 
host. The charge of the ionic core, when valence electrons are stripped away, differs from that of the host ions, 
and therefore the impurity acts as a local charge impurity. For example, copper has an electronic configuration of 
[Ar] 3d 10 4s 1 . The 4s electron forms an energy band which contains the Fermi surface. Zinc has a configuration of 
[Ar] 3c? 10 4s 2 , and in a Cu matrix the Zn gives up its two 4s electrons into the 4s conduction band, leaving behind 
a charge +2 ionic core. The Cu cores have charge +1 since each copper atom contributed only one 4s electron to 
the conduction band. The conduction band electrons neutralize the uniform positive background of the Cu ion 
cores. What is left is an extra Q = +e nuclear charge at the Zn site, and one extra 4s conduction band electron. 
The Q = +e impurity is, however, screened by the electrons, and at distances greater than an atomic radius the 
potential that a given electron sees due to the Zn core is of the Yukawa form, 



W = 7 



(6.274) 
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We should take care, however, that the dispersion e(k) for the conduction band in a metal is not necessarily of the 
free electron form e(k) = h 2 k 2 /2m. To linear order in the potential, however, the change in the local electronic 
density is 

5n(r) = ecf>(r) g(e F ) , (6.275) 
where g(e F ) is the density of states at the Fermi energy. Thus, in a metal, we should write 

V 2 = (-4<7r)(-e5n) 

2 , s 2 ( 6 - 276 ) 

= 47re g[e F ) <j> = k tf 4> , 

where 

k tf = ^ 47re 2 g(e F ) . (6.277) 

The value of g(e F ) will depend on the form of the dispersion. For ballistic bands with an effective mass m*, the 
formula in eqn. 6.272 still applies. 



The Thomas-Fermi atom 



Consider an ion formed of a nucleus of charge +Ze and an electron cloud of charge —Ne. The net ionic charge is 
then (Z — N)e. Since we will be interested in atomic scales, we can no longer assume a weak field limit and we 
must retain the full nonlinear screening theory, for which 

(2777 s ) 3 / 2 / X 3 / 2 

V 2 (Xr) = 47re • ^ — 2_( e + e 0(r)l -^Ze5(r). (6.278) 

We assume an isotropic solution. It is then convenient to define 

Ze 2 

e F + e0(r) = X(r/r ), (6.279) 

r 

where ?* is yet to be determined. As r — > we expect X — > 1 since the nuclear charge is then unscreened. We then 
have 

V 2 f— • X(r/r )\ = \^ x "( r /r ) , (6.280) 



r 



thus we arrive at the Thomas-Fermi equation, 



X"(t) = -^X 3/2 (t) , (6.281) 



with r = tr Q , provided we take 

fc2 / o \2/3 

r °=wU7l) -°- 885 ^ 1/3 « BJ (6-282) 
where a B = = 0.529 A is the Bohr radius. The TF equation is subject to the following boundary conditions: 



• At short distances, the nucleus is unscreened, i.e. 

X(0) = 1 



(6.283) 
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Figure 6.19: The Thomas-Fermi atom consists of a nuclear charge +Ze surrounded by N electrons distributed in a 
cloud. The electric potential <j>(r) felt by any electron at position r is screened by the electrons within this radius, 
resulting in a self -consistent potential <f>(r) = <f>o + (Ze 2 jr) X(r/ro). 



For positive ions, with N < Z, there is perfect screening at the ionic boundary R = t* r , where X(t*) = 0. 
This requires 

Ze 2 N . Ze 2 „.,,„, J „ (Z-N)e 



This requires 



—^X(R/r ) + —X'(R/r Q ) 
R z Rr n 



-t* X '(t*) = l--. 



r = 



R 2 



r . 



(6.284) 



(6.285) 



For an atom, with N = Z, the asymptotic solution to the TF equation is a power law, and by inspection is found 
to be X(t) ~ C t~~ 3 , where C is a constant. The constant follows from the TF equation, which yields 12 C = C 3 / 2 , 
hence C = 144. Thus, a neutral TF atom has a density with a power law tail, with p ~ r -9 / 2 . TF ions with N > Z 
are unstable. 



6.7 Polymers 

6.7.1 Basic concepts 

Linear chain polymers are repeating structures with the chemical formula (A) x , where A is the formula unit and 
x is the degree of polymerization. In many cases (e.g. polystyrene), x > 10 5 is not uncommon. For a very readable 
introduction to the subject, see P. G. de Gennes, Scaling Concepts in Polymer Physics. 

Quite often a given polymer solution will contain a distribution of x values; this is known as polydispersity. Various 
preparation techniques, such as chromatography, can mitigate the degree of polydispersity. Another morpholog- 
ical feature of polymers is branching (see fig. 6.21), in which the polymers do not form linear chains. 

Polymers exhibit a static flexibility which can be understood as follows. Consider a long chain hydrocarbon with a 
— C — C — C— backbone. The angle between successive C — C bonds is fixed at 9 « 68°, but the azimuthal angle 
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H h H H 
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polyethylene (GH 2 ) X polyacetylene (CH)^ polyoxyethylene (021140)3, 
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I 
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1 

H 



H 
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t H 

polystyrene (CgHg)^ poly(methyl methacrylate) (CgHgO^ 

Figure 6.20: Some examples of linear chain polymers. 

(f can take one of three possible low-energy values, as shown in the right panel of fig. 6.22. Thus, the relative 
probabilities of gauche and trans orientations are 

Prob (gauche) =2e _ AE/fcBT; 
Prob (trans) 

where Ae is the energy difference between trans and gauche configurations. This means that the polymer chain is 
in fact a random coil with a persistence length 

£ p = £ Q e A£ / k * T , (6.287) 

where £ is a microscopic length scale, roughly given by the length of a formula unit, which is approximately a 
few Angstroms (see fig. 6.23). Let L be the total length of the polymer when it is stretched into a straight line. If 
£ p > L, the polymer is rigid. If £ p <C L, the polymer is rigid on the length scale £ p but flexible on longer scales. We 
have 

£ JL = 1 e ^/k B T (6 28g) 

L N 

where we now use N (rather than x) for the degree of polymerization. 

In the time domain, the polymer exhibits a dynamical flexibility on scales longer than a persistence time. The persis- 
tence time r p is the time required for a trans-gauche transition. The rate for such transitions is set by the energy 
barrier B separating trans from gauche configurations: 

r p = r e B ' k » T , (6.289) 

where r ~ 10~ n s. On frequency scales w <C r" 1 the polymer is dynamically flexible. If Ae ~ k B T <C B the 
polymer is flexible from a static point of view, but dynamically rigid. That is, there are many gauche orientations 
of successive carbon bonds which reflect a quenched disorder. The polymer then forms a frozen random coil, like 
a twisted coat hanger. 
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Figure 6.21: Branched polymers. 



6.7.2 Polymers as random walks 



A polymer can be modeled by a self-avoiding random walk (SAW). That is, on scales longer than £ p , it twists about 
randomly in space subject to the constraint that it doesn't overlap itself. Before we consider the mathematics of 
SAWs, let's first recall some aspects of ordinary random walks which are not self-avoiding. 

We'll simplify matters further by considering random walks on a hypercubic lattice of dimension d. Such a lattice 
has coordination number 2d, i.e. there are 2d nearest neighbor separations, 



±a e 



d i 



(6.290) 



where a is the lattice spacing. Consider now a random walk of N steps starting at the origin. After N steps the 
position is 



R 



N 



(6.291) 



where Sj takes on one of 2d possible values. Now N is no longer the degree of polymerization, but more like L / I , 
which is the number of persistence lengths in the chain. We assume each step is independent, hence (8j ■ <$ ■/) = 
a 2 <$ ■ ■, and (R 2 N ) = Na 2 . The full distribution P N {R) is given by 



P N (R) = (2d)- N Y,---/Z S R^ J S J 



a 



7r / a 

i f 
J 2ir 

7T / a 



7r / a 



d g—ik-R 



-ir/a 



i d r 

- 2J cosik^a) 



d 

L iu=l 



(2tt)° 



AT In 1 



1 



2„2 



— I ld d ke- Nk2a ^ 2d e- tkR = 



■k z a 
2d 

( d 
\27riV 



(6.292) 



-dR 2 /2Na 2 



This is a simple Gaussian, with width (-R 2 ) = d- (Na 2 /d) — Na 2 , as we have already computed. We define 



Rn 



N a as the radius of gyration. 



The total number of random walk configurations with end-to-end vector R is then (2d) P N (R), so the entropy of 
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a chain at fixed elongation is 



S(R, N) = k B In (2d)"P N (R) = 5(0, N) 



dk m R 2 



JVV"/j-"v-."y 2Na 2 
If we assume that the energy of the chain is conformation independent, then E = E (N) and 

F(R,N)=F(0,N) + ^^. 

In the presence of an external force F oxt , the Gibbs free energy is the Legendre transform 

G(F cxt ,N)=F(R 7 N)-F cxt -R, 

and dG/dR = then gives the relation 

No 2 



(R(F cxt ,N)) 



dKT ext 



This may be considered an equation of state for the polymer. 



(6.293) 

(6.294) 
(6.295) 

(6.296) 



Following de Gennes, consider a chain with charges ±e at each end, placed in an external electric field of magni- 
tude E = 30,000V/cm. Let N = 10 4 , a = 2 A, and d = 3. What is the elongation? From the above formula, we 
have 



eER 
3k a T 



0. 



(6.297) 



Structure factor 



We can also compute the structure factor (see eqn. 6.190), 

N N 

S («)4(SE«* M )' (6 - 298) 

m— 1 n— 1 
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Figure 6.23: The polymer chain as a random coil. 



where 



For averages with respect to a Gaussian distribution, 

( e «K*»-**)) = cxp [ _ . {Rm R n) f) . 

We then have 

i N 

S ^ = 7j H e"l m -"l g2a2 / 2 = Nf(Nq 2 a 2 /2) , 



m.n—1 



/M-£(.--i + .)-i-f + £ 



(6.299) 



(6.300) 



(6.301) 



Self-avoiding walks 

On a lattice of coordination number z, the number of TV-step random walks starting from the origin is Q N = z N . 
If we constrain our random walks to be self -avoiding, the number is reduced to 

fi$" = CN^ 1 y N , (6.302) 

where C and 7 are dimension-dependent constants, and we expect y < z — 1, since at the very least a SAW cannot 
immediately double back on itself. In fact, on the cubic lattice one has z = 6 but y = 4.68, slightly less than z — 1. 
One finds 7 d=2 — § and 7 d=3 — g. The RMS end-to-end distance of the SAW is 



R F = aN" 



(6.303) 



where a and v are rf-dependent constants, with v d=x = 1, ^ d=2 ~ |, and ^ rf=3 ~ |. The distribution P N (R) has a 
scaling form, 



1 „/ 



-^F V^F 



One finds 



(a < i? < ATa) 



x < 1 



exp(— cc" 5 ) a; » 1 , 



(6.304) 



(6.305) 



with g = (7 — l)/v and 5 = 1/(1 — v). 
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6.7.3 Flory theory 

What is missing from the random walk free energy is the effect of steric interactions. Suppose we have a chain of 
radius R. Then the average monomer density within the chain is c = N/R d . Assuming short-ranged interactions, 
we should then add a term to the free energy which effectively counts the number of near self -intersections of the 
chain. This number should be roughly Nc. Thus, we write 

F(R,N) = F + u(T)^ + ldk B T^. (6.306) 

The effective interaction u(T) is positive in the case of a so-called 'good solvent'. 
The free energy is minimized when 



which yields the result 



9F dvN J- Ah T ^ (f> *™ 



R F (N)= (^) +2 N^ 



Thus, we obtain u = 3/(d + 2), and ^ d=3 = |, which is extremely close to the correct value v = 0.5880 obtained 
from numerical work and renormalization group calculations. 

For a SAW under an external force, we compute the Gibbs partition function, 

Y(F eKt ,N) = J d d R P N (R) e F ^- R ' k * T = J d d x f(x) e sh x , (6.309) 

where x = R/R F and s = k B T/R F F cxt and n = F ext . One than has R(F ext ) = i? F <£(i? F /£), where £ = k B T/F ext 
and R(F cxt ) = F cxt i?|//c B T. For small values of its argument one has <P(u) cx u. For large u it can be shown that 

R(F cxt ) cx (F GXtJ R F /fc B T) 2 / 3 . 



6.7.4 Polymers and solvents 

Consider a solution of monodisperse polymers of length N in a solvent. Let <fi be the dimensionless monomer 
concentration, so <fi/N is the dimensionless polymer concentration and <fi s = 1 — (f> is the dimensionless solvent 
concentration. (Dimensionless concentrations are obtained by dividing the corresponding dimensionful concen- 
tration by the overall density.) The entropy of mixing for such a system is given by eqn. 2.352. We have 

Vk ( 1 1 
S mix = — -a. |-01n0+(l-0)ln(l-0)j , (6.310) 

where v cx a 3 is the volume per monomer. Accounting for an interaction between the monomer and the solvent, 
we have that the free energy of mixing is 

= 1 01n0 + (1 - 0) ln(l - </>) + Xftl ~ 4>) ■ (6.311) 
V k B l N 

where X is the dimensionless polymer-solvent interaction, called the Flory parameter. This provides a mean field 
theory of the polymer-solvent system. 



The osmotic pressure 77 is defined by 

OF ■ 

jj mix 



dV 



(6.312) 

N„ 
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which is the variation of the free energy of mixing with respect to volume holding the number of polymers constant. 
The monomer concentration is <f> = NN p v /V, so 



d_ 

dV 



d 



No NN p v 



(6.313) 



Now we have 



and therefore 



F 1Dix = NN p k B T{ — In, 



n 



1) ln(l - 4>) + X (1 - 



(N- 1 - 1) - ln(l - 0) -X< 



(6.314) 



(6.315) 



In the limit of vanishing monomer concentration i 



n 



0, we recover 

d>k„T 



(6.316) 



which is the ideal gas law for polymers. 

For iV -1 <C 4> <C 1, we expand the logarithm and obtain 



y n 

KT 



-0+i(l-2X)0 2 
i(l-2X) 2 . 



0( 



(6.317) 



Note that 77 > only if X < \, which is the condition for a 'good solvent'. 

In fact, eqn. 6.317 is only qualitatively correct. In the limit where X ^ \, Flory showed that the individual polymer 
coils behave much as hard spheres of radius R F . The osmotic pressure then satisfies something analogous to a 
virial equation of state: 



n 



Nv, 



A 



Nv r 



Nv, 



o 



(6.318) 



This is generalized to a scaling form in the second line, where h(x) is a scaling function, and (jf = Nv /Rp cx 



N 4 / 5 , assuming d = 3 and v = | from Flory theory. As x 



0, we must recover the ideal gas law, so 



h(x) = l+0(x) in this limit. For x — > oo, we require that the result be independent of the degree of polymerization 
N. This means h(x) cx x p with |p = 1, i.e. p = 4. The result is known as the des Cloiseaux law: 



V7 



C^ 4 



(6.319) 



where C is a constant. This is valid for what is known as semi-dilute solutions, where <jf <C 4> *C 1. In the dense 
limit (f> ~ 1, the results do not exhibit this universality, and we must appeal to the theory of liquids as discussed 
in §6.5. 
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6.8 Appendix : Potts Model in One Dimension 



6.8.1 Definition 



The Potts model is defined by the Hamiltonian 

Here, the spin variables a i take values in the set {1, 2, . . . , q} on each site. The equivalent of an external magnetic 
field in the Ising case is a field h which prefers a particular value of a (a = 1 in the above Hamiltonian). Once 
again, it is not possible to compute the partition function on general lattices, however in one dimension we may 
once again find Z using the transfer matrix method. 



6.8.2 Transfer matrix 



On a ring of N sites, we have 

Z = Tre-? H 

= E e 

= Tr (R N ) 

where the q x q transfer matrix R is given by 



R i = e pj8 a „, e iPh8*,i e \PHS at>l = < 



In matrix form, 



e 0h/2 e 0J 



R 



oPh/2 



e 0h/2 



' e P(J+h) 


if a 


e PJ 


if a 


« e Ph/2 


if a 


e Ph/2 


if a 


1 


if a 


V 




e /3h/2 




1 




e /3J 





1 
1 



of J 1 

1 e pj 



The matrix R has q eigenvalues A J7 with j = 1, . . . , q. The partition function for the Potts chain is then 



3=1 



We can actually find the eigenvalues of R analytically. To this end, consider the vectors 

, V=(<7-l + e^r 1/2 







1 



V 1 / 



(6.321) 



(6.322) 



(6.323) 



(6.324) 



(6.325) 
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Then R may be written as 

R = (e 13 - 1 - 1) I + (q - 1 + e? h ) \ i/> ) ( xj> \ + (e? J - l) (e f3h - l) | $ )( <$> \ , (6.326) 
where I is the 5x9 identity matrix. When h = 0, we have a simpler form, 

R= (e' 3J - + ■ (6-327) 

From this we can read off the eigenvalues: 

A x = e' 37 + (6.328) 
Xj=e^ J -l , ie{2,...,g}, (6.329) 

since \ip) is an eigenvector with eigenvalue A = e /3J + g — 1, and any vector orthogonal to | if) ) has eigenvalue 
A = e 3,7 — 1. The partition function is then 

Z = (e? J + q-l) N + ( q - l)(e? J - 1)" . (6.330) 

In the thermodynamic limit N — ► 00 , only the A x eigenvalue contributes, and we have 

F(T, N,h = 0) = -Nk B Tln ( e J/fc B T + q - l) f or N -> 00 . (6.331) 

When ft is nonzero, the calculation becomes somewhat more tedious, but still relatively easy. The problem is that 
I t/j ) and | <fr ) are not orthogonal, so we define 

where 

/ e Ph \l/2 

xs M*> = [ q -l + ef>k ) ■ (6333) 
Now we have ( X | ip ) = 0, with ( X | X ) = 1 and ( V | V> ) = h wi th 



|0) = v 7 ! -s 2 \x)+x\i>) ■ (6-334) 

and the transfer matrix is then 

R = (e 0J - l) I + (g- - 1 + e 1311 ) \ ip){if>\ 



+ ( e ^_l)( e ^-i) 



(l-x 2 )\X)(X\+x 2 \4>)(i;\+xVl~x 2 (\X}(il>\ + \il>){X\ 



e 0J - 1 



q-l + e f3h ) + (eP J -l)(e/ 3h -l 
+ (^-l)(^-l) (^ n ^)l^)(^l 



g - 1 + C 3 ' 1 



(6-335) 



+ (e" - 1) (e* - 1) (^YT^) 1/2 (l^l + I^X 
which in the two-dimensional subspace spanned by | X ) and | ip ) is of the form 



•eft)" 
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Recall that for any 2x2 Hermitian matrix, 



M = a n I + a ■ t 



a o ' a 3 a i ~ * a ; 
a, + ia 9 a n — a. 



the characteristic polynomial is 



P(A) = dot (A I - AI) = (A - a ) 5 



a, — a 9 — a 



3 ' 



and hence the eigenvalues are 



A ± = a ± V fl l + a 2 + «3 



For the transfer matrix of eqn. 6.335, we obtain, after a little work, 



eJ 3J - 1 



± 



q-l + e P h +(eP J -l)(e^-l) 



(6.337) 

(6.338) 
(6.339) 

(6.340) 



g - 1 + eP h + (eP J - l) (e^ - l) - 4(g - 1) (eJ 3J - l) (e/ 3 ' 1 - l) 



There are q — 2 other eigenvalues, however, associated with the (</— 2) -dimensional subspace orthogonal to | X) 
and Clearly all these eigenvalues are given by 



A, = e! iJ - 1 



3 e {3 , . . . , q) 



(6.341) 



The partition function is then 



Z = Af + Af + (q - 2) A^ , (6.342) 



and in the thermodynamic limit — > oo the maximum eigenvalue A x dominates. Note that we recover the correct 
limit as h — > 0. 



Chapter 7 

Mean Field Theory of Phase Transitions 
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7.2 The van der Waals system 

7.2.1 Equation of state 

Recall the van der Waals equation of state, 



p+±y v -b) = RT, (7.1) 

where v = N A V/N is the molar volume. Solving for p(v, T), we have 

RT a 

P = 7 - - • (7-2) 

v — o v z 

Let us fix the temperature T and examine the function p(v). Clearly p(v) is a decreasing function of volume for 
v just above the minimum allowed value v = b, as well as for v — > oo. But is p(v) a monotonic function for all 

v € [b, oo]? 

We can answer this by computing the derivative, 

dp\ 2a RT 



dv Jrp v 3 {v — b) 2 

Setting this expression to zero for finite v, we obtain the equation 1 

2a u 3 
bRT ~ {u - l) 2 ' 



(7.3) 



(7.4) 



where u = v/b\s dimensionless. It is easy to see that the function /(it) = u 3 /(u — l) 2 has a unique minimum 
for u > 1. Setting f(u*) = yields u* = 3, and so f min = /(3) = f . Thus, for T > T c = 8a/27bR, the LHS of 
eqn. 7.4 lies below the minimum value of the RHS, and there is no solution. This means that p(v, T > T c ) is a 
monotonically decreasing function of v. 

At T = T c there is a saddle-node bifurcation. Setting v c = bu* = 3b and evaluating p c = p(v c , T c ), we have that the 
location of the critical point for the van der Waals system is 

n 8a 

«c-36 , T c = — — . (7.5) 



^ 27b2 , c , 27bR 

For T < T c , there are two solutions to eqn. 7.4, corresponding to a local minimum and a local maximum of the 
function p(v). The locus of points in the (v,p) plane for which (dp/dv) T = is obtained by setting eqn. 7.3 to zero 
and solving for T, then substituting this into eqn. 7.2. The result is 

*/ \ a 2a6 

P* ("= — - — ■ ( 7 -6) 

v v 

Expressed in terms of dimensionless quantities p = p/p c and v = v/v c , this equation becomes 

*?*(«) = 4-4- (7.7) 
v v 



1 There is always a solution to (dp/dv) T = at v = oo. 
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Along the curve p = p*(v), the isothermal compressibility, n T = — ^(^) T diverges, heralding a thermodynamic 
instability. To understand better, let us compute the free energy of the van der Waals system, F = E — TS. 
Regarding the energy E, we showed back in chapter 2 that 

=t(£\ -p=4, (7-8) 



dvJ T \dT J. 



'v 



v 



which entails 

e{T,v) = \fRT-- , (7.9) 

where e = E/v is the molar internal energy. The first term is the molar energy of an ideal gas, where / is 
the number of molecular freedoms, which is the appropriate low density limit. The molar specific heat is then 

C V — \dTJv — 2- 



— ^ — t;R, which means that the molar entropy is 



s(T,v) = J T dT' C ^ = tRHT/T c ) + Sl ( 



(7.10) 



We then write / = e — Ts, and we fix the function s 1 (v) by demanding that p = — (f^) T - This yields s x («) 
Rln(v — b) + s , where s is a constant. Thus 2 , 



f(T,v) = {RT (l - \n(T/T c ) ) - - - RTln{v - b) - Ts Q . (7.11) 



We know that under equilibrium conditions, / is driven to a minimum by spontaneous processes. Now suppose 

that f^r | T < over some range of v at a given temperature T. This would mean that one mole of the system at 
volume v and temperature T could lower its energy by rearranging into two half-moles, with respective molar 
volumes v ± Sv, each at temperature T. The total volume and temperature thus remain fixed, but the free energy 

changes by an amount Af = i | T (i5u) 2 < 0. This means that the system is unstable - it can lower its energy by 



2 Don't confuse the molar free energy (/) with the number of molecular degrees of freedom (/)! 
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gas 


a 

V mol- / 


b (A) 

\ mol / 


p c (bar) 


T c (K) 


v c (L/mol) 


Acetone 


14.09 


0.0994 


52.82 


505.1 


0.2982 


Argon 


1.363 


0.03219 


48.72 


150.9 


0.0966 


Carbon dioxide 


3.640 


0.04267 


7404 


304.0 


0.1280 


hthanol 


12.18 


0.08407 


63.83 


516.3 


0.2522 


Freon 


10.78 


0.0998 


40.09 


384.9 


0.2994 


Helium 


0.03457 


0.0237 


2.279 


5.198 


0.0711 


Hydrogen 


0.2476 


0.02661 


12.95 


33.16 


0.0798 


Mercury 


8.200 


0.01696 


1055 


1723 


0.0509 


Methane 


2.283 


0.04278 


46.20 


190.2 


0.1283 


Nitrogen 


1.408 


0.03913 


34.06 


128.2 


0.1174 


Oxygen 


1.378 


0.03183 


50.37 


154.3 


0.0955 


Water 


5.536 


0.03049 


220.6 


647.0 


0.0915 



Table 7.1: Van der Waals parameters for some common gases. (Source: Wikipedia) 



dividing up into two subsystems each with different densities (i.e. molar volumes). Note that the onset of stability 
occurs when 



dv 2 



dp 
dv 



1 



0, 



(7.12) 



which is to say when n p = oo. As we saw, this occurs at p = p* (v), given in eqn. 7.6. 



f) 2 f i 

However, this condition, |^ | < 0, is in fact too strong. That is, the system can be unstable even at molar volumes 

where §^\ T > 0. The reason is shown graphically in fig. 7.2. At the fixed temperature T, for any molar volume v 
between «i iquid = v 1 and v gas = v 2 , the system can lower its free energy by phase separating into regions of different 
molar volumes. In general we can write 

v = (1 — x) v 1 + xv 2 , (7.13) 
so v = v 1 when x — and v = v 2 when x = 1. The free energy upon phase separation is simply 

f=(l-x)f 1 +xf 2 , (7.14) 

where /• = f(v-,T). This function is given by the straight black line connecting the points at volumes v 1 and v 2 
in fig. 7.2. 



The two equations which give us v 1 and v 2 are 



dv 



df 
dv 



and 



/(7>i) - /( T ^' 2 ) = ( v 2 - v i) |^ 



In terms of the pressure, p = — 1£ \ T , these equations are equivalent to 



(7.15) 



(7.16) 



(7.17) 



dvp(T, v) = {y 2 - Vj) p{T, v x ) 



(7.18) 
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2 3 
volume v/v c 

Figure 7.2: Molar free energy f(T, v) of the van der Waals system T = 0.85 T c , with dot-dashed black line showing 
Maxwell construction connecting molar volumes V1.2 on opposite sides of the coexistence curve. 

This procedure is known as the Maxwell construction, and is depicted graphically in Fig. 7.3. When the Maxwell 
construction is enforced, the isotherms resemble the curves in Fig. 7.4. In this figure, all points within the purple 
shaded region have ^ < 0, hence this region is unstable to infinitesimal fluctuations. The boundary of this 
region is called the spinodal, and the spontaneous phase separation into two phases is a process known as spinodal 
decomposition. The dot-dashed orange curve, called the coexistence curve, marks the instability boundary for nucle- 
ation. In a nucleation process, an energy barrier must be overcome in order to achieve the lower free energy state. 
There is no energy barrier for spinodal decomposition - it is a spontaneous process. 



7.2.2 Analytic form of the coexistence curve near the critical point 

We write v L = v c + w L and v G = v c + w G . One of our equations is p(v c + w L ,T) = p(v c + w G ,T). Taylor expanding 
in powers of w L and w G , we have 



0=P„(v c ,T) (w G - w L ) + \p vv (v cl T) {w% - wl) + \p vm ,{v c ,T) {w% - w[) + 



where 



dp 
dv 



d 2 p 
dv 2 



d 3 p 
dv 3 ' 



d 2 p 
dvdT 



etc. 



The second equation we write as 

fdwp{v c + w,T) = \{w G - w L )(p(v c + w L ,T) +p(v c + w G ,T)j . 

Expanding in powers of w L and w G , this becomes 

p(v c ,T) (w G - w L ) + \p v {v c ,T) (w% ~ wl) + \p vv {v c ,T) (w% - t^) 

+ mPwv( v c T ) ( w g - w t) + mPww( v c T ) (wl- w t) +■■■ 

= \{w G ~ w L )^2p{v c ,T) + p v {v c ,T) (w G + w L ) + \p vv {v c ,T) (wl + wl) 

+ \Pvvv( V V T ) ( w l + w l) + j4Pvvvv( v ci T ) K+ w l)+--'} 



(7.19) 
(7.20) 

(7.21) 

(7.22) 
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V, 



2 

V 



Figure 7.3: Maxwell construction in the (v,p) plane. The system is absolutely unstable between volumes Vd and 
v e . For v G [v a , Vd] of v G [u c , u c ], the solution is unstable with respect to phase separation. Source: Wikipedia. 



Subtracting the LHS from the RHS, we find that we can then divide by | (w G — wf), resulting in 

=p vv (v c ,T) + \p vvv (v c ,T) (w G +w L ) + ^p vvvv (v c ,T) (3wl + Aw G w L + 3wl) + £>(u4i_) 

We now define w± = w G ± w L . In terms of these variables, eqns. 7.19 and 7.23 become 

= p v (v c , T) + y vv (v c , T) w + + \p vvv {v c , T)(wl + \w 2 _)+0{wl) 
= P vv (v c ,T) + \ p mv {v c ,T)w + + \p vvvv {v c ,T) (w\ + \w 2 _) +0(w 3 ±) . 



(7.23) 



(7.24) 
(7.25) 



We now evaluate w ± toorderT. Note that p v (v c , T c ) = p vv (v c ,T c ) = 0, since the critical point is an inflection point 
in the (v,p) plane. Thus, we have p v (v cl T) = p vT + O{0 2 ), where T = T c + and p vT = p vT {v c , T c ). We can 
then see that w_ is of leading order \/ — , while w + is of leading order 0. This allows us to write 



Thus, 



We then have 



= P vT ® + jiPv 



O{0 2 



= P vvT & + \ p vvv w + + ^ p vvvv w 2 _ +O(0 2 ) 



w_ 



Pvvv 



1/2 



-0 + ... 

IQPvvv PvvT 



5p 



2 



9 



6 PvT 

Pvvv 



1/2 



'-0 



6 Pvl 



1/2 



V^0 + 



tp vT P 

r J + O(0 3 ^ 2 ) 

<9 + e>(<9 3/2 ) . 



^ Pvvv 

3 P V T Pvvvv ~ ^ Pvvv PvvT 



^ Pvvv 



(7.26) 
(7.27) 



(7.28) 
(7.29) 

(7.30) 
(7.31) 
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0.5 1 1.5 2 2.5 3 



molar volume v/v c 

Figure 7.4: Pressure-volume isotherms for the van der Waals system, as in Fig. 7.1, but corrected to account for 
the Maxwell construction. The boundary of the purple shaded region is the spinodal line p* (v). The boundary of 
the orange shaded region is the stability boundary with respect to phase separation. 



Suppose we follow along an isotherm starting from the high molar volume (gas) phase. If T > T c , the volume 
v decreases continuously as the pressure p increases. If T < T c , then at the instant the isotherm first intersects 
the orange boundary curve in Fig. 7.4, there is a discontinuous change in the molar volume from high (gas) to 
low (liquid). This discontinuous change is the hallmark of a first order phase transition. Note that the volume 
discontinuity, Av = w_ oc (T c — T) 1 / 2 . This is an example of a critical behavior in which the order parameter 0, which 
in this case may be taken to be the difference <fi — v G — v L , behaves as a power law in |T — T c \, where T c is the 
critical temperature. In this case, we have (j>(T) oc (T c — T)+, where (3 = \ is the exponent, and where (T c — T) + is 
defined to be T c — T if T < T c and otherwise. The isothermal compressibility is n T = —v/p v (v, T). This is finite 
along the coexistence curve - it diverges only along the spinodal. It therefore diverges at the critical point, which 
lies at the intersection of the spinodal and the coexistence curve. 

It is convenient to express the equation of state and the coexistence curve in terms of dimensionless variables. 
Write 

- V - v - T 

p = — , v= — , T = — . (7.32) 

Pc v c T c 



The van der Waals equation of state then becomes 



ST 3 

P = iTz — 7 - ■ (7-33) 

6V — 1 U z 



Further expressing these dimensionless quantities in terms of distance from the critical point, we write 

P=1 + tt , v = l + e , f = 1 + 1 . (7.34) 

Thus, 
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Note that the LHS and the RHS of this equation vanish identically for (ir, e, t) = (0, 0, 0). We can then write 

e LG = T plfet Y 2 (_f)i/2 + p^ t 7r -5 7r eee7 r A + ( ( _ t)3 /2) (736) 

7.2.3 History of the van der Waals equation 

The van der Waals equation of state first appears in van der Waals' 1873 PhD thesis 3 , "Over de Continuiteit van den 
Gas - en Vloeistoftoestand" ("On the continuity of the gas and liquid state"). In his Nobel lecture 4 , van der Waals 
writes of how he was inspired by Rudolf Clausius' 1857 treatise on the nature of heat, where it is posited that a gas 
in fact consists of microscopic particles whizzing around at high velocities, van der Waals reasoned that liquids, 
which result when gases are compressed, also consist of 'small moving particles': "Thus I conceived the idea that 
there is no essential difference between the gaseous and the liquid state of matter. . . " 

Clausius' treatise showed how his kinetic theory of heat was consistent with Boyle's law for gases (pV = constant 
at fixed temperature), van der Waals pondered why this might fail for the non-dilute liquid phase, and he reasoned 
that there were two principal differences: inter-particle attraction and excluded volume. These considerations 
prompted him to posit his famous equation, 

RT a „_ 
P = - - • (7.37) 

v — o v 

The first term on the RHS accounts for excluded volume effects, and the second for mutual attractions. 

In the limiting case of p — ► oo, the molar volume approaches v = b. On physical grounds it seems that one should 
take b = N A ■ 8v , where v is the molecular volume. The factor of eight arises from the hard core intermolecular 
potential effective for separations less than the molecular diameter (i.e. twice the molecular radius). Expanding in 
inverse powers of v, though, 

and we read of the second virial coefficient B 2 = (b— /N A . But we have computed from the Mayer expansion 
that B 2 = Av , so the nai've guess for b is twice too large. 

Another of van der Waals' great achievements was his articulation of the law of corresponding states. Recall that 
the van der Waals equation of state, when written in terms of dimensionless quantities p = p/p c , v — v/v c , and 
T = T/T c , takes the form of eqn. 7.33. Thus, while the a and b parameters are specific to each fluid - see Tab. 
7.1 - when written in terms of these scaled dimensionless variables, the equation of state and all its consequent 
properties (i.e. the liquid-gas phase transition) are universal. 

The van der Waals equation is best viewed as semi-phenomenological. Interaction and excluded volume effects 
surely are present, but the van der Waals equation itself only captures them in a very approximate way. It is 
applicable to gases, where it successfully predicts features that are not present in ideal systems (e.g. throttling). It is 
of only qualitative and pedagogical use in the study of fluids, the essential physics of which lies in the behavior of 
quantities like the pair distribution function <?(?*). As we saw in chapter 6, any adequate first principles derivation 
of g(r) - a function which can be measured in scattering experiments - involves rather complicated approximation 
schemes to close the BBGKY hierarchy. Else one must resort to numerical simulations such as the Monte Carlo 
method. Nevertheless, the lessons learned from the van der Waals system are invaluable and they provide us 



3 Johannes Diderik van der Waals, the eldest of ten children, was the son of a carpenter. As a child he received only a primary school 
education. He worked for a living until age 25, and was able to enroll in a three-year industrial evening school for working class youth. 
Afterward he continued his studies independently, in his spare time, working as a teacher. By the time he obtained his PhD, he was 36 years 
old. He received the Nobel Prize for Physics in 1910. 

4 See http://www.nobelprize.org/nobel_prizes/physics/laureates/1910/waals-lecture.pdf 
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Figure 7.5: 'Universality' of the liquid-gas transition for eight different atomic and molecular fluids, from E. A. 
Guggenheim, /. Chem. Phys. 13, 253 (1945). Dimensionless pressure p/p c versus dimensionless density p/p c = 
v c /v is shown. The van der Waals / mean field theory gives Av = w gas — «Uq U id oc (— t) 1 / 2 , while experiments show 
a result closer to Av oc (— i) 1 / 3 . Here t = T — 1 = (T — T c )/T c is the dimensionless temperature deviation with 
respect to the critical point. 

with a first glimpse of what is going on in the vicinity of a phase transition, and how nonanalytic behavior, such 
as v G — v L oc (T c — T) 13 with noninteger exponent (3 may result due to singularities in the free energy at the critical 
point. 



7.3 Fluids, Magnets, and the Ising Model 

7.3.1 Lattice gas description of a fluid 

The usual description of a fluid follows from a continuum Hamiltonian of the form 

N 2 

i=l i<j 

The potential u(r) is typically central, depending only on the magnitude |r|, and short-ranged. Now consider a 
discretized version of the fluid, in which we divide up space into cells (cubes, say), each of which can accommo- 
date at most one fluid particle (due to excluded volume effects). That is, each cube has a volume on the order of 
a 3 , where a is the diameter of the fluid particles. In a given cube i we set the occupancy n i = 1 if a fluid particle is 
present and n i = if there is no fluid particle present. We then have that the potential energy is 



U = J2 u ( x i - x j) = \ J2 V RR' n R n R' ' 
i<j R^R' 



(7.40) 
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Figure 7.6: The lattice gas model. An occupied cell corresponds to n = 1 (a = +1), and a vacant cell to n = 

(<7=-l). 

where V RR , w v(R — R), where R k is the position at the center of cube k. The grand partition function is then 
approximated as 

(7.41) 



where 



~ (T, V, fi) « Y, II ex P - ¥ E V RR> n R n R' 

{n R } V R ' V R^R' 

£ = e »X T d a d , 



(7.42) 



where a is the side length of each cube (chosen to be on the order of the hard sphere diameter). The \ T d factor 
arises from the integration over the momenta. Note J2r u r = N is the total number of fluid particles, so 



= Z N = e^ N X T Nd a Nd 



(7.43) 



R 



Thus, we can write a lattice Hamiltonian, 

H=\Y V RR' n R n R'- k B Tln ^Y n R 
R^R' R 

= ~\ E J RR' a R a R' ~ H E a R + E ° ■' 
R^R' R 

where <j r = 2ri R — 1 is a spin variable taking the possible values { — 1, +1}, and 

J RR> = ~I V RR' 



(7.44) 



H=ik B T]nt;-±J2' V RR> 



(7.45) 



R> 



where the prime on the sum indicates that R' = R is to be excluded. For the Lennard-Jones system, V RR , = 
v(R — R) < is due to the attractive tail of the potential, hence J RR , is positive, which prefers alignment of the 
spins <j r and a R , . This interaction is therefore ferromagnetic. The spin Hamiltonian in eqn. 7.44 is known as the 
Ising model. 
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7.3.2 Phase diagrams and critical exponents 

The physics of the liquid-gas transition in fact has a great deal in common with that of the transition between a 
magnetized and unmagnetized state of a magnetic system. The correspondences are 5 

p < ► H , v < > m , 

where m is the magnetization density defined here to be the total magnetization M divided by the number of 
lattice sites iV 5 : 6 

5 5 R 

Sketches of the phase diagrams are reproduced in fig. 7.7. Of particular interest is the critical point, which occurs 
at (T c ,p c ) in the fluid system and (T C ,H C ) in the magnetic system, with H c = by symmetry. 

In the fluid, the coexistence curve in the (p, T) plane separates high density (liquid) and low density (vapor) 
phases. The specific volume v (or the density n = v^ 1 ) jumps discontinuously across the coexistence curve. In the 
magnet, the coexistence curve in the (H, T) plane separates positive magnetization and negative magnetization 
phases. The magnetization density m jumps discontinuously across the coexistence curve. For T > T c , the latter 
system is a paramagnet, in which the magnetization varies smoothly as a function of H. This behavior is most 
apparent in the bottom panel of the figure, where v(p) and m(H) curves are shown. 

For T < T c , the fluid exists in a two phase region, which is spatially inhomogeneous, supporting local regions of 
high and low density. There is no stable homogeneous thermodynamic phase for (T, v) within the two phase 
region shown in the middle left panel. Similarly, for the magnet, there is no stable homogeneous thermodynamic 
phase at fixed temperature T and magnetization m if (T, m) lies within the coexistence region. Rather, the system 
consists of blobs where the spin is predominantly up, and blobs where the spin is predominantly down. 

Note also the analogy between the isothermal compressibility n T and the isothermal susceptibility X T : 

The 'order parameter' for a second order phase transition is a quantity which vanishes in the disordered phase 
and is finite in the ordered phase. For the fluid, the order parameter can be chosen to be * oc (v va — v u ), the 
difference in the specific volumes of the vapor and liquid phases. In the vicinity of the critical point, the system 
exhibits power law behavior in many physical quantities, viz. 

m(T,H c )~(T c -T)l 
X(T,H c )~\T-T c \-"< 

(7.47) 

C M {T,H c )~\T-T c \-<* 
m{T c ,H)~±\H\ 1 ' & . 

5 One could equally well identify the second correspondence as n < ► m between density (rather than specific volume) and magnetization. 

One might object that H is more properly analogous to fi. However, since \i = fj,(p, T) it can equally be regarded as analogous to p. Note also 
that f3p = zAy d for the ideal gas, in which case 5 = z(a/X T ) d is proportional to p. 

6 Note the distinction between the number of lattice sites N s and the number of occupied cells N. According to our definitions, N = 

k(M + N s ). 
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Figure 7.7: Comparison of the liquid-gas phase diagram with that of the Ising ferromagnet. 



The quantities a, (3, j, and 5 are the critical exponents associated with the transition. These exponents satisfy certain 
equalities, such as the Rushbrooke and Griffiths relations and hyperscaling, 7 



a + 2(3 + 7 = 2 
(3 + 7 = (35 
2 — a = dv 



(Rushbrooke) 

(Griffiths) 

(hyperscaling) 



(7.48) 



Originally such relations were derived as inequalities, and only after the advent of scaling and renormalization 
group theories it was realized that they held as equalities. We shall have much more to say about critical behavior 
later on, when we discuss scaling and renormalization. 



7 In the third of the following exponent equalities, d is the dimension of space and v is the correlation length exponent. 
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7.3.3 Gibbs-Duhem relation for magnetic systems 



Homogeneity of E(S, M, N s ) means E = TS + HAI + fiN s , and, after invoking the First Law dE = TdS + H dM + 
H dN s , we have 

SdT + M dH + N s dfi = . (7.49) 



Now consider two magnetic phases in coexistence. We must have d^i 1 = d\i 2 , hence 

dfj,i = —s 1 dT — m 1 dH = —s 2 dT — m 2 dH = d/i 2 , 



(7.50) 



where m = MJ N s is the magnetization per site and s = S/ N s is the specific entropy. Thus, we obtain the Clapeyron 
equation for magnetic systems, 

'§) =--^-. (7-51) 

- dT J cocx 1 - m 2 

Thus, if m 1 ^ m 2 and (^) cocx = 0, then we must have s 1 = s 2 , which says that there is no latent heat associated with 
the transition. This absence of latent heat is a consequence of the symmetry which guarantees that F(T, H, N s ) = 
F(T,-H,N S ). 



7.3 A Order-disorder transitions 

Another application of the Ising model lies in the theory of order-disorder transitions in alloys. Examples include 
CU3AU, CuZn, and other compounds. In CuZn, the Cu and Zn atoms occupy sites of a body centered cubic (BCC) 
lattice, forming an alloy known as /3-brass. Below T c ~ 740 K, the atoms are ordered, with the Cu preferentially 
occupying one simple cubic sublattice and the Zn preferentially occupying the other. 

The energy is a sum of pairwise interactions, with a given link contributing e AA , e BB , or e AB , depending on whether 
it is an A- A, B-B, or A-B/B-A link. Here A and B represent Cu and Zn, respectively. Thus, we can write the energy 
of the link (ij) as 

(7.52) 



E- - = e P A P A + e P B P B 



£ P A P B + P B P A 

AB \ 1 J ' * I * J 



where 



Pt =1(1 + ^) = 



1 if site i contains Cu 
if site i contains Zn 



1 if site i contains Zn 
if site i contains Cu . 



The Hamiltonian is then 



{ 3 ( £ aa + £ BB - 2£ A b) crj + 3 (^aa - £ BB ) + Oj.) + i (e AA + e BB + 2e AB ) j 



= -J^2a i a j -H^2a i + E 

(ij) 



(7.53) 
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where the exchange constant J and the magnetic field H are given by 



J=We p 



H - l( £ BB 



(7.54) 



and E = |7Vz(e AA + e BB + 2e AB ), where N is the total number of lattice sites and z — 8 is the lattice coordination 
number, which is the number of nearest neighbors of any given site. 



Note that 



2e AB > s AA + e BB ==> J > (ferromagnetic) 
2e AB < e AA + e BB J < (antiferromagnetic) . 



(7.55) 



The antiferromagnetic case is depicted in fig. 7.8. 



7.4 Mean Field Theory 

Consider the Ising model Hamiltonian, 

H = -J^ViO, -H^a t , (7.56) 

where the first sum on the RHS is over all links of the lattice. Each spin can be either 'up' (a = +1) or 'down' (a = 
— 1). We further assume that the spins are located on a Bravais lattice 8 and that the coupling Jy = J(|i? ? : — Rj\), 
where Ri is the position of the i th spin. 

On each site i we decompose <r i into a contribution from its thermodynamic average and a fluctuation term, i.e. 

a i = (a,) + Sa, . (7.57) 

8 A Bravais lattice is one in which any site is equivalent to any other site through an appropriate discrete translation. Examples of Bravais 
lattices include the linear chain, square, triangular, simple cubic, face-centered cubic, etc. lattices. The honeycomb lattice is not a Bravais lattice, 
because there are two sets of inequivalent sites - those in the center of a Y and those in the center of an upside down Y. 
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We will write (a,-) = m, the local magnetization (dimensionless), and assume that m is independent of position i. 
Then 

fx j <Jj = (m + <5cTj) (m + £<7 3 -) 

= m 2 + m (<5cr ? + 5(7,(5^ (7.58) 
= — m 2 + m (a i + aj) + 5a t 5a . 

The last term on the RHS of the second equation above is quadratic in the fluctuations, and we assume this to be 
negligibly small. Thus, we obtain the mean field Hamiltonian 

H MF = \Nz3 to 2 - (H + zJm) ^ <?i , (7-59) 

i 

where N is the total number of lattice sites. The first term is a constant, although the value of m is yet to be 
determined. The Boltzmann weights are then completely determined by the second term, which is just what we 
would write down for a Hamiltonian of noninteracting spins in an effective 'mean field' 

H cS = H + zJm . (7.60) 

In other words, H cS = H cxt + H int , where the external field is applied field H ext = H, and the 'internal field' is 
i?i nt = zJm. The internal field accounts for the interaction with the average values of all other spins coupled to a 
spin at a given site, hence it is often called the 'mean field'. Since the spins are noninteracting, we have 

m = ePX^ + e-'** = t&nl \-J^T— ) ■ (7 - 61) 

It is a simple matter to solve for the free energy, given the noninteracting Hamiltonian H MF . The partition function 
is 

i / \ N 

Z = Tr cT^mf = e -20NzJm 2 1 e p(H+z.j m y \ = £ -pf _ ^ 6Tj 

We now define dimensionless variables: 

f=^- T , 9 = ^ , h=- r , (7.63) 
NzJ zJ zJ 

and obtain the dimensionless free energy 

/(m, h, 9) = \m 2 - 9 In ^ m + h )/ e + e -^+h)/o^ 
Differentiating with respect to m gives the mean field equation, 



(7.64) 



m = tanh ( — i — , (7.65) 



which is equivalent to the self-consistency requirement, m = (ai). 

7.4.1 h = Q 

When h = the mean field equation becomes 



m = tanh (J-jpj ■ (7.66) 
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magnetization m 

Figure 7.9: Results for h = 0. Upper panels: graphical solution to self -consistency equation m = tanh(m/0) at 
temperatures 9 = 0.65 (blue) and 9 = 1.5 (dark red). Lower panel: mean field free energy, with energy shifted by 
In 2 so that f(m = 0,6) = 0. 

This nonlinear equation can be solved graphically, as in the top panel of fig. 7.9. The RHS in a tanh function 
which gets steeper with decreasing t. If, at m = 0, the slope of tanh(m/0) is smaller than unity, then the curve 
y = tanh(m//i) will intersect y = m only at m = 0. However, if the slope is larger than unity, there will be three 
such intersections. Since the slope is 1/0, we identify 9 C = 1 as the mean field transition temperature. 

In the low temperature phase 9 < 1, there are three solutions to the mean field equations. One solution is always 
at m = 0. The other two solutions must be related by the m <-> — m symmetry of the free energy (when h = 0). 
The exact free energies are plotted in the bottom panel of fig. 7.9, but it is possible to make analytical progress by 
assuming m is small and Taylor expanding the free energy f(m, 9) in powers of m: 



f(m,9) = \m 2 -01n2-0 In cosh 
= -0 ln2+ |(1 -0" x )m : 



(7.67) 



12 a 



45 9 5 



Note that the sign of the quadratic term is positive for 9 > 1 and negative for 9 < 1 . Thus, the shape of the free 
energy f(m, 9) as a function of m qualitatively changes at this point, 9 C = 1, the mean field transition temperature, 
also known as the critical temperature. 

For 9 > 9 C , the free energy f(m, 9) has a single minimum at m = 0. Below C , the curvature at m = reverses, and 
m = becomes a local maximum. There are then two equivalent minima symmetrically displaced on either side 
of m = 0. Differentiating with respect to m, we find these local minima. For 9 < C , the local minima are found at 



m 



30 2 (1 - 9) = 3(1 - 9) + C((l - 9) 2 ) 



(7.68) 
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Thus, we find for |0 - 1| < I, 

m{9, h = 0) = ±V3 (l - 0) J_ /2 , (7.69) 

where the + subscript indicates that this solution is only for 1 — > 0. For 6 > 1 the only solution is m = 0. The 
exponent with which m(0) vanishes as — > 0~ is denoted /3. I.e. m(0, ft. = 0) oc (0 C — 0)(L 

7.4.2 Specific heat 

We can now expand the free energy f(9, ft = 0). We find 

' -9 In 2 if > 0, 

-01n2-|(l-0) 2 + O((l-0) 4 ) if0<0 

Thus, if we compute the heat capacity we find in the vicinity of = 0, 

n d 2 f fo if0>6 



f(6,h = 0) = { n ,__ n 3 „^ _ c (7.70) 



90 2 1§ if0<0 c 



(7.71) 



Thus, the specific heat is discontinuous at = C . We emphasize that this is only valid near 9 = 9 C = 1. The general 
result valid for all is 9 

1 m 2 (0)-m 4 (0) 
1 + m 2 (0) ' 

With this expression one can check both limits 0^0 and — > C . As — > the magnetization saturates and one 
has m 2 (0) ~ 1 — 4e~ 2 / e . The numerator then vanishes as e~ 2 / 6 , which overwhelms the denominator that itself 
vanishes as 2 . As a result, c v (9 — > 0) = 0, as expected. As — > 1, invoking m 2 ~ 3(1 — 0) we recover c v 



3 
2 • 



In the theory of critical phenomena, c v (9) oc |0 — C | a as — > C . We see that mean field theory yields a = 0. 
7.4.3 ft ^ 

Consider without loss of generality the case h > 0. The minimum of the free energy /(m, h, 9) now lies at m > 
for any 9. At low temperatures, the double well structure we found in the h = case is tilted so that the right 
well lies lower in energy than the left well. This is depicted in fig. 7.10. As the temperature is raised, the local 
minimum at m < vanishes, annihilating with the local maximum in a saddle-node bifurcation. To find where 

this happens, one sets = and = simultaneously, resulting in 



1 + 



n0) = VT^--ln( J -^ T =). (7.73) 

The solutions lie at ft, = ±h*(9). For < C = 1 and h g \—h*(6) , +h*(6)], there are three solutions to the mean 
field equation. Equivalently we could in principle invert the above expression to obtain 0* (h). For > 0* (h), there 
is only a single global minimum in the free energy f(m) and there is no local minimum. Note 9*(h = 0) = 1. 

Assuming h <C |0 — 1| <C 1, the mean field solution for m(0, h) will also be small, and we expand the free energy 
in m, and to linear order in h: 

f(m, h, 9) = -9 In 2 + 1(1 - £T*) m 2 + ^ - ^ (? 74) 

= /o + I i° - l ) ™ 2 + T2 mi -hm + ... . 



To obtain this result, one writes / = f(9, m{&)\ and then differentiates twice with respect to 9, using the chain rule. Along the way, any 

as 



naked (i.e. undifferentiated) term proportional to J^- may be dropped, since this vanishes at any 6 by the mean field equation. 
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-0.1 
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magnetization m 



Figure 7.10: Results for h = 0.1. Upper panels: graphical solution to self-consistency equation m = tanh((m + 
h)/8) at temperatures 8 = 0.65 (blue), 8 = 0.9 (dark green), and 8 = 1.5 (dark red). Lower panel: mean field free 
energy, with energy shifted by 9 In 2 so that f(m = 0,8) = 0. 



Setting = 0, we obtain 

im 3 + (6-l)-m-h = 0. (7.75) 

If 8 > 1 then we have a solution m = h/{8 — 1). The m 3 term can be ignored because it is higher order in h, and 
we have assumed h -C \6 — 1| -C 1. This is known as the Curie-Weiss law 10 . The magnetic susceptibility behaves as 

^) = ^~-|.-ir, (7.76) 

where the magnetization critical exponent 7 is 7 = 1. If 6 < 1 then while there is still a solution at to = hj(8 — 1), 
it lies at a local maximum of the free energy, as shown in fig. 7.10. The minimum of the free energy occurs close 
to the h = solution to = to (6>) ee v^(l - 8), and writing to = to + 5m we find to linear order in h as 
Sm(8, h) = h/2(l - 8). Thus, 

m (e,h) = V3(l-6) + ^--. (7.77) 

10 Pierre Curie was a pioneer in the fields of crystallography, magnetism, and radiation physics. In 1880, Pierre and his older brother Jacques 
discovered piezoelectricity. He was 21 years old at the time. It was in 1895 that Pierre made the first systematic studies of the effects of 
temperature on magnetic materials, and he formulated what is known as Curie's Law, X = C/T, where C is a constant. Curie married Marie 
Sklodowska in the same year. Their research turned toward radiation, recently discovered by Becquerel and Rontgen. In 1898, Pierre and 
Marie Curie discovered radium. They shared the 1903 Nobel Prize in Physics with Becquerel. Marie went on to win the 1911 Nobel Prize in 
Chemistry and was the first person ever awarded two Nobel Prizes. Their daughter Irene Joliot Curie shared the 1935 Prize in Chemistry (with 
her husband), also for work on radioactivity. Pierre Curie met an untimely and unfortunate end in the Spring of 1906. Walking across the 
Place Dauphine, he slipped and fell under a heavy horse-drawn wagon carrying military uniforms. His skull was crushed by one of the wagon 
wheels, killing him instantly. Later on that year, Pierre-Ernest Weiss proposed a modification of Curie's Law to account for ferromagnetism. 
This became known as the Curie- Weiss law, X = C/(T — T c ). 
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2D Ising 


3D Ising 


co 2 


Exponent 


MFT 


(exact) 


(numerical) 


(expt.) 


a 








0.125 


£0.1 


a 


1/2 


1/8 


0.313 


0.35 


i 


1 


7/4 


1.25 


1.26 


5 


3 


15 


5 


4.2 



Table 7.2: Critical exponents from mean field theory as compared with exact results for the two-dimensional Ising 
model, numerical results for the three-dimensional Ising model, and experiments on the liquid-gas transition in 
CO2. Source: H. E. Stanley, Phase Transitions and Critical Phenomena. 



Once again, we find that X(9) diverges as \9 — 1| 7 with 7 = 1. The exponent 7 on either side of the transition is 
the same. 

Finally, we can set 9 = 9 C and examine m(h). We find, from eqn. 7.76, 

m(9 = e c ,h) = (3/i) 1/3 oc h 1/s , (7.78) 

where S is a new critical exponent. Mean field theory gives (5 = 3. Note that at 6 = 6 C = 1 we have m = tanh(m+/i), 
and inverting we find 

h{m,e = 6 c ) = \J\±^)-m= r ^ + r ^ + ... , (7.79) 

z \ 1 — my 3 5 

which is consistent with what we just found for m(h,6 = 9 C ). 

How well does mean field theory do in describing the phase transition of the Ising model? In table 7.2 we compare 
our mean field results for the exponents a, (3, 7, and S with exact values for the two-dimensional Ising model, 
numerical work on the three-dimensional Ising model, and experiments on the liquid-gas transition in CO2 . The 
first thing to note is that the exponents are dependent on the dimension of space, and this is something that mean 
field theory completely misses. In fact, it turns out that the mean field exponents are exact provided d > d u , where 
d u is the upper critical dimension of the theory. For the Ising model, d u = 4, and above four dimensions (which is of 
course unphysical) the mean field exponents are in fact exact. We see that all in all the MFT results compare better 
with the three dimensional exponent values than with the two-dimensional ones - this makes sense since MFT 
does better in higher dimensions. The reason for this is that higher dimensions means more nearest neighbors, 
which has the effect of reducing the relative importance of the fluctuations we neglected to include. 



7.4.4 Magnetization dynamics 

Dissipative processes drive physical systems to minimum energy states. We can crudely model the dissipative 
dynamics of a magnet by writing the phenomenological equation 



dm df 
ds dm ' 

where s is a dimensionless time variable. Under these dynamics, the free energy is never increasing: 

£ = df_ dm = J df_ \ 2 < Q 
ds dm ds \ dm J ~ 

Clearly the fixed point of these dynamics, where m = 0, is a solution to the mean field equation §^ =0. 



(7.80) 



(7.81) 
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Figure 7.11: Dissipative magnetization dynamics rh = —f'{m). Bottom panel shows h*(9) from eqn. 7.73. For 
(9, h) within the blue shaded region, the free energy /(m) has a global minimum plus a local minimum and a 
local maximum. Otherwise f{m) has only a single global minimum. Top panels show an imperfect bifurcation 
in the magnetization dynamics at h = 0.0215 , for which 9* = 0.90. Temperatures shown: 8 = 0.65 (blue), 
6 = 9* (h) = 0.90 (green), and 6 = 1.2. The rightmost stable fixed point corresponds to the global minimum of the 
free energy. The bottom of the middle two upper panels shows h = 0, where both of the attractive fixed points 
and the repulsive fixed point coalesce into a single attractive fixed point (supercritical pitchfork bifurcation). 

The phase flow for the equation m = — /'(m) is shown in fig. 7.11. As we have seen, for any value of h there is 
a temperature 8* below which the free energy f(m) has two local minima and one local maximum. When h = 
the minima are degenerate, but at finite h one of the minima is a global minimum. Thus, for 9 < 9*{h) there 
are three solutions to the mean field equations. In the language of dynamical systems, under the dynamics of 
eqn. 7.80, minima of f(m) correspond to attractive fixed points and maxima to repulsive fixed points. If h > 0, 
the rightmost of these fixed points corresponds to the global minimum of the free energy. As 9 is increased, 
this fixed point evolves smoothly. At 9 = 9* , the (metastable) local minimum and the local maximum coalesce 
and annihilate in a saddle-note bifurcation. However at h = all three fixed points coalesce at 9 = 9 C and the 
bifurcation is a supercritical pitchfork. As a function of t at finite h, the dynamics are said to exhibit an imperfect 
bifurcation, which is a deformed supercritical pitchfork. 

The solution set for the mean field equation is simply expressed by inverting the tanh function to obtain h(9, m). 
One readily finds 

h(6, m) = - In ( l + - m . (7.82) 
2 \ 1 — m J 

As we see in the bottom panel of fig. 7.12, m(h) becomes multivalued for h G [ — h*{9) , +h*(9)], where h*{9) 
is given in eqn. 7.73. Now imagine that 9 < 9 C and we slowly ramp the field h from a large negative value to 
a large positive value, and then slowly back down to its original value. On the time scale of the magnetization 
dynamics, we can regard h(s) as a constant. (Remember the time variable is s here.) Thus, m(s) will flow to the 
nearest stable fixed point. Initially the system starts with m = — 1 and h large and negative, and there is only 
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Figure 7.12: Top panel : hysteresis as a function of ramping the dimensionless magnetic field h at 9 = 0.40. Dark 
red arrows below the curve follow evolution of the magnetization on slow increase of h. Dark grey arrows above 
the curve follow evolution of the magnetization on slow decrease of h. Bottom panel : solution set for m(8, h) as 
a function of h at temperatures 9 = 0.40 (blue), 9 = 9 C = 1.0 (dark green), and t = 1.25 (red). 



one fixed point, at m* ~ — 1. As h slowly increases, the fixed point value m* also slowly increases. As h exceeds 
— h*(8), a saddle-node bifurcation occurs, and two new fixed points are created at positive m, one stable and one 
unstable. The global minimum of the free energy still lies at the fixed point with m* < 0. However, when h 
crosses h = 0, the global minimum of the free energy lies at the most positive fixed point m* . The dynamics, 
however, keep the system stuck in what is a metastable phase. This persists until h = +h*(9), at which point 
another saddle-note bifurcation occurs, and the attractive fixed point at m* < annihilates with the repulsive 
fixed point. The dynamics then act quickly to drive m to the only remaining fixed point. This process is depicted 
in the top panel of fig. 7.12. As one can see from the figure, the the system follows a stable fixed point until the 
fixed point disappears, even though that fixed point may not always correspond to a global minimum of the free 
energy. The resulting m(h) curve is then not reversible as a function of time, and it possesses a characteristic shape 
known as a hysteresis loop. Etymologically, the word hysteresis derives from the Greek vareprjan,, which means 
'lagging behind'. Systems which are hysteretic exhibit a history-dependence to their status, which is not uniquely 
determined by external conditions. Hysteresis may be exhibited with respect to changes in applied magnetic field, 
changes in temperature, or changes in other externally determined parameters. 

7.4.5 Beyond nearest neighbors 

Suppose we had started with the more general model, 

(7.83) 

i=/Lj i 
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where Jij is the coupling between spins on sites i and j. In the top equation above, each pair (ij) is counted once 
in the interaction term; this may be replaced by a sum over all i and j if we include a factor of The resulting 
mean field Hamiltonian is then 

H MF = ±NJ(0) m 2 -{H + J(0) to) ^ a t . (7.84) 

i 

Here, J(q) is the Fourier transform of the interaction matrix J^' 

J(q) = J2 J ( R ) e ~ i9 ' R ■ ( 7 " 85 ) 
R 

For nearest neighbor interactions only one has J(0) = zJ, where z is the lattice coordination number, i.e. the number 
of nearest neighbors of any given site. The scaled free energy is as in eqn. 7.64, with / = F/NJ(Q), 9 = fc B T/J(0), 
and h = H/ Jifi). The analysis proceeds precisely as before, and we conclude 9 C = 1, i.e. fc B T c MF = J(0). 



7.4.6 Ising model with long-ranged forces 



Consider an Ising model where — J/N for all i and j, so that there is a very weak interaction between every 
pair of spins. The Hamiltonian is then 



2N 

The partition function is 

Z = Tr {cr . } exp 
We now invoke the Gaussian integral, 



2N 



(7.86) 



(7.87) 



dx e~ ax - px 



(7.88) 



Thus, 



exp 



9N [ Z^ a > 



= (^fj /2 [dme-^ Nf3Jm2+ P Jm ^°> 



and we can write the partition function as 



Z = 



Nf3J 
2tt 



1/2 



OO 



— oo 
oo 



(7.89) 



(7.90) 



11 The self-interaction terms with i = j contribute a constant to H and may be either included or excluded. However, this property only 
pertains to the u i = ±1 model. For higher spin versions of the Ising model, say where S, S { — 1, 0, +1}, then Sf is not constant and we 
should explicitly exclude the self-interaction terms. 

12 The sum in the discrete Fourier transform is over all 'direct Bravais lattice vectors' and the wavevector q may be restricted to the 'first 
Brillouin zone'. These terms are familiar from elementary solid state physics. 
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where 6 = k B T/J, h = H/J, and 



A{m) 



- 6» In 



2 cosh 



(7.91) 



Since N — > oo, we can perform the integral using the method of steepest descents. Thus, we must set 

' m* + /i s 



dA 
dm 







m* = tanh 



Expanding about m = m* , we write 



Aim) = A(m*) + \A"{m*) (m - m*) 2 + i A"'(m*) (m - m*) 3 + 



Performing the integrations, we obtain 

1/2 



JVi"(m*) 2 7VA'"(m*) , 

JTl 771 + 



2^ 



G6» 



y/A"(m*) 

The corresponding free energy per site 



^ =e -NA(mn/e. (i + orjv- 1 )) 



/ = — = A(m*) + — lnA"(m*) + 0{N~ 2 ) , 



(7.92) 



(7.93) 



(7.94) 



(7.95) 



where m* is the solution to the mean field equation which minimizes A(m). Mean field theory is exact for this 
model! 



7.5 Variational Density Matrix Method 



7.5.1 The variational principle 



Suppose we are given a Hamiltonian H. From this we construct the free energy, F: 

F = E-TS 

= Ti(qH) + k B TTr(g \ng) . 



(7.96) 



Here, g is the density matrix 1 ^. A physical density matrix must be (i) normalized {i.e. Tr g = 1), (ii) Hermitian, and 
(iii) non-negative definite (i.e. all the eigenvalues of g must be non-negative). 

Our goal is to extremize the free energy subject to the various constraints on g. Let us assume that g is diagonal in 
the basis of eigenstates of H, i.e. 

7 



13 How do we take the logarithm of a matrix? The rule is this: A = In B if B = exp(^l). The exponential of a matrix may be evaluated via 
its Taylor expansion. 
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where P 7 is the probability that the system is in state | 7 ). Then 

f = E i p f + K T R > ln p -y ■ ( 7 - 98) 

7 7 

Thus, the free energy is a function of the set {P 7 }. We now extremize F subject to the normalization constraint. 
This means we form the extended function 

F*({P 7 },A)=P({P 7 })+a(]Tp 7 -i) , (7.99) 

7 

and then freely extremize over both the probabilities {-P 7 } as well as the Lagrange multiplier A. This yields the 
Boltzmann distribution, 

= i exp(-P 7 /fc B T) , (7.100) 

where Z = J2 7 e~ E " r ' k ^ T = Tr er H l k ^ T is the canonical partition function, which is related to A through 

A = k B T (ln Z — 1) . (7.101) 
Note that the Boltzmann weights are, appropriately, all positive. 

If the spectrum of H is bounded from below, our extremum should in fact yield a minimum for the free energy 
F. Furthermore, since we have freely minimized over all the probabilities, subject to the single normalization 
constraint, any distribution {P 7 } other than the equilibrium one must yield a greater value ofF. 

Alas, the Boltzmann distribution, while exact, is often intractable to evaluate. For one-dimensional systems, there 
are general methods such as the transfer matrix approach which do permit an exact evaluation of the free energy. 
However, beyond one dimension the situation is in general hopeless. A family of solvable ("integrable") mod- 
els exists in two dimensions, but their solutions require specialized techniques and are extremely difficult. The 
idea behind the variational density matrix approximation is to construct a tractable trial density matrix q which 
depends on a set of variational parameters {x a }, and to minimize with respect to this (finite) set. 

7.5.2 Variational density matrix for the Ising model 

Consider once again the Ising model Hamiltonian, 

H = -J2 J y ^ °V - H ai ■ ( 7102 ) 

i<j i 

The states of the system | 7 ) may be labeled by the values of the spin variables: | 7 ) < — ► | o\ , er 2 , • ■ ■ ) ■ We assume 
the density matrix is diagonal in this basis, i.e. 

QN ( 1 \y) = e ( 1 )6^ l , (7.103) 

where 

&r.7'=II^- (7104) 

i 

Indeed, this is the case for the exact density matrix, which is to say the Boltzmann weight, 

QN {a u a 2 ,...) = ]-e-^^-^ . (7.105) 
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We now write a trial density matrix which is a product over contributions from independent single sites: 

Q N (<ri,<r2,---) =JJe(o-») , (7.106) 

where 

- *-.x + C-^) S.,-1 (7-107) 



Note that we've changed our notation slightly. We are denoting by g(a) the corresponding diagonal element of 
the matrix 

e = ( o 4v ' (7 ' 108) 

and the full density matrix is a tensor product over the single site matrices: 

g N = g® q® ■■ -® g . (7.109) 

Note that £ and hence g N are appropriately normalized. The variational parameter here is m, which, if p is to be 
non-negative definite, must satisfy — 1 < m < 1. The quantity m has the physical interpretation of the average 
spin on any given site, since 

We may now evaluate the average energy: 

E = Tr (g N H) = - ^2 J y m 2 - H^m 

i<j i 

= — |iVJ(0) m 2 - NHm , (7.111) 
where once again J(0) is the discrete Fourier transform of J(R) at wavevector q = 0. The entropy is given by 

S = -k B Tr(g N \ng N ) = —Nk B Tr (g In g) 



-Nk 



We now define the dimensionless free energy per site: / = F/NJ(0). We have 

/<„, h,S) - - W - *,» + . { (1±2) ,» (1±2) + (i^) ,» (i^) } 



(7.113) 



where = fe B T/j(0) is the dimensionless temperature, and /i = H/j(0) the dimensionless magnetic field, as 
before. We extremize /(m) by setting 

£-0~»-* + Stafi±=). (7-114) 



9to 2 V 1 — m 

Solving for to, we obtain 



to = tanh [ I , (7.115) 



which is precisely what we found in eqn. 7.65. 
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Figure 7.13: Variational field free energy A/ = /(to, h, 9) + 9 In 2 versus magnetization to at six equally spaced 
temperatures interpolating between 'high' (9 = 1.25, red) and 'low' (9 = 0.75, blue) values. Top panel: h = 0. 
Bottom panel: h = 0.06. 

Note that the optimal value of m indeed satisfies the requirement |m| < 1 of non-negative probability. This 
nonlinear equation may be solved graphically. For h = 0, the unmagnetized solution m = always applies. 
However, for 9 < 1 there are two additional solutions at to = ±m A (0), with m A (9) = y/3(l — 9) + OUl — 9) 3 ^ 2 ) 
for t close to (but less than) one. These solutions, which are related by the Z2 symmetry of the h = model, are 
in fact the low energy solutions. This is shown clearly in figure 7.13, where the variational free energy /(to, t) is 
plotted as a function of m for a range of temperatures interpolating between 'high' and Tow' values. At the critical 
temperature 9 C = 1, the lowest energy state changes from being unmagnetized (high temperature) to magnetized 
(low temperature). 

For h > 0, there is no longer a Z2 symmetry (i.e. <ji — > — cr, V i). The high temperature solution now has m > 
(or to < if h < 0), and this smoothly varies as t is lowered, approaching the completely polarized limit m = 1 
as 9 — > 0. At very high temperatures, the argument of the tanh function is small, and we may approximate 
tanh(x) ~ x, in which case 




(7.116) 



This is called the Curie-Weiss law. One can infer 9 C from the high temperature susceptibility X(9) = {dm/dh) h=0 
by plotting X -1 versus 9 and extrapolating to obtain the ^-intercept. In our case, X(9) = (6 — C ) -1 . For low 9 and 
weak h, there are two inequivalent minima in the free energy. 

When to is small, it is appropriate to expand / (to, h, 9), obtaining 

f{m,h,6) = -9 ln2- hm + ± (6 - 1) to 2 + A m 4 + JL m e + ^ TO « + _ _ _ _ (7.117) 

This is known as the Landau expansion of the free energy in terms of the order parameter to. An order parameter is a 
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thermodynamic variable <j> which distinguishes ordered and disordered phases. Typically <fr = in the disordered 
(high temperature) phase, and <f> ^ in the ordered (low temperature) phase. When the order sets in continuously, 
i.e. when <f> is continuous across 9 C , the phase transition is said to be second order. When <fi changes abruptly, the 
transition is first order. It is also quite commonplace to observe phase transitions between two ordered states. For 
example, a crystal, which is an ordered state, may change its lattice structure, say from a high temperature tetrag- 
onal phase to a low temperature orthorhombic phase. When the high T phase possesses the same symmetries 
as the low T phase, as in the tetragonal-to-orthorhombic example, the transition may be second order. When the 
two symmetries are completely unrelated, for example in a hexagonal-to-tetragonal transition, or in a transition 
between a ferromagnet and an antiferromagnet, the transition is in general first order. 

Throughout this discussion, we have assumed that the interactions Jij are predominantly ferromagnetic, i.e. Jy > 0, 
so that all the spins prefer to align. When Jy < 0, the interaction is said to be antiferromagnetic and prefers anti- 
alignment of the spins {i.e. <Tj <?j = —1). Clearly not every pair of spins can be anti-aligned - there are two possible 
spin states and a thermodynamically extensive number of spins. But on the square lattice, for example, if the only 
interactions Jij are between nearest neighbors and the interactions are antiferromagnetic, then the lowest energy 
configuration (T = ground state) will be one in which spins on opposite sublattices are anti-aligned. The square 
lattice is bipartite - it breaks up into two interpenetrating sublattices A and B (which are themselves square lattices, 
rotated by 45° with respect to the original, and with a larger lattice constant by a factor of \J%, such that any site in 
A has nearest neighbors in B, and vice versa. The honeycomb lattice is another example of a bipartite lattice. So is 
the simple cubic lattice. The triangular lattice, however, is not bipartite (it is tripartite). Consequently, with nearest 
neighbor antiferromagnetic interactions, the triangular lattice Ising model is highly frustrated. The moral of the 
story is this: antiferromagnetic interactions can give rise to complicated magnetic ordering, and, when frustrated 
by the lattice geometry, may have finite specific entropy even at T = 0. 



7.5.3 Mean Field Theory of the Potts Model 

The Hamiltonian for the Potts model is 

i<j i 

Here, Oj g {1, . . . , q], with integer q. This is the so-called 'g-state Potts model'. The quantity H is analogous to an 
external magnetic field, and preferentially aligns (for H > 0) the local spins in the a = 1 direction. We will assume 
H > 0. 

The g-component set is conveniently taken to be the integers from 1 to q, but it could be anything, such as 

Oj G {tomato, penny, ostrich, Grateful Dead ticket from 1987, . . .} . (7.119) 

The interaction energy is — Jij if sites i and j contain the same object (q possibilities), and if i and j contain 
different objects (q 2 — q possibilities). 

The two-state Potts model is equivalent to the Ising model. Let the allowed values of a be ±1. Then the quantity 

<W = \ + \o°' (7.120) 

equals 1 if a = a', and is zero otherwise. The three-state Potts model cannot be written as a simple three-state 
Ising model, i.e. one with a bilinear interaction a a' where a G {— 1, 0, +1}. However, it is straightforward to verify 
the identity 

5a,*> = 1 + 5 oa' + | (iV 2 - {a 2 + a 12 ) . (7.121) 

Thus, the q = 3-state Potts model is equivalent to a S = 1 (three-state) Ising model which includes both bilinear 
(era') and biquadratic (crV 2 ) interactions, as well as a local field term which couples to the square of the spin, 
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a 2 . In general one can find such correspondences for higher q Potts models, but, as should be expected, the 
interactions become increasingly complex, with bi-cubic, bi-quartic, bi-quintic, etc. terms. 

Getting back to the mean field theory, we write the single site variational density matrix g as a diagonal matrix 
with entries 

'1 -x s 



g(a) = xS a ,i + 



(7.122) 



with g N (a 1 , . . . , a N ) = 0(0^) ■ • • g(a N ). Note that Tr (g) = 1. The variational parameter is x. When x = q~ x , all 
states are equally probable. But for x > q~ x , the state a = 1 is preferred, and the other (q — 1) states have identical 
but smaller probabilities. It is a simple matter to compute the energy and entropy: 



E = Tr (g N H) = -±NJ(0)^x 2 + ^ ^ j 
S = —k B Tr (g N In g N ) = —Nk B |x lux + (1 — x) In 
The dimensionless free energy per site is then 



f{x,e,h) 



(1 - xf 

q-i 



+ 9 < xlnx + (1 - a:) In 



- NHx 

1 - x 
a- 1 



1 - X 

~ 1 



(7.123) 



hx 



where h = H/J(0). We now extremize with respect to x to obtain the mean field equation, 



df 

dx 



= -x 



l-x 
~ 1 



\nx-9 In 



l-x 



- h 



(7.124) 



(7.125) 



Note that for h = 0, x = q 1 is a solution, corresponding to a disordered state in which all states are equally 
probable. At high temperatures, for small h, we expect x — q~ 



x = q 1 



cx h. Indeed, using Mathematica one can set 
s , (7.126) 
and expand the mean field equation in powers of s. One obtains 

q 3 (q - 2) I 



For weak fields, \ h\ <c 1, and we have 



g-i 



s(6) 



2(q-lf 
(q-l)h 



(7.127) 



0(h 2 



q ( q e-i) 

which again is of the Curie- Weiss form. The difference s = x — q^ 1 is the order parameter for the transition. 
Finally, one can expand the free energy in powers of s, obtaining the Landau expansion, 

2/j + l „f„fl_1\ ^_o^3 



(7.128) 



f{s,9,h) 



2q 



g(gg-l) a (q-2)q 
> In q — ns + —r-, — s 



2(9-1) 



12 



l+(«-l) 

.5 



q- 

30 



20 

l + (q- I) -5 



6(g- l) 2 

i-( g -ir 4 



(7.129) 



Note that, for q = 2, the coefficients of s 3 , s 5 , and higher order odd powers of s vanish in the Landau expansion. 
This is consistent with what we found for the Ising model, and is related to the Z 2 symmetry of that model. For 
q > 3, there is a cubic term in the mean field free energy, and thus we generically expect a first order transition, as 
we shall see below when we discuss Landau theory. 
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7.5.4 Mean Field Theory of the XY Model 



Consider the so-called XY model, in which each site contains a continuous planar spin, represented by an angular 
variable^ G [— ir, tt]: 

H = — Jij cos (0i — 4>j) — H cos (f>i . (7.130) 



We write the (diagonal elements of the) full density matrix once again as a product: 

£^(01,^2, ■ ■ ■) = J|£»(0i) • 



(7.131) 



Our goal will be to extremize the free energy with respect to the function g(4>). To this end, we compute 



E = Tr(g N H) = -iNJ(0) 



Tr (pe**) 



-NHTr (q coscp) 



The entropy is 



S = -Nk B Jr(g Ing) 



Note that for any function A(<fi), we have 



14 



7T 

Tr{ e A)^ e^)A{4>). 



(7.132) 
(7.133) 

(7.134) 



We now extremize the functional F[g(</>)] = E — TS with respect to g((j>), under the condition that Tr g = 1. We 
therefore use Lagrange's method of undetermined multipliers, writing 

F = F — Nk B T A ( Tr g - 1 



(7.135) 

Note that F is a function of the Lagrange multiplier A and afunctional of the density matrix g(<j>). The prefactor 
Nk B T which multiplies A is of no mathematical consequence - we could always redefine the multiplier to be A' = 
Nk B TX. It is present only to maintain homogeneity and proper dimensionality of F* with A itself dimensionless 
and of order TV . We now have 



SF 



5g((f>) 5g((f>) 



- i7VJ(0) Tr(£>e # ) - NHTr(g cos 



Nk B T Tr (g In g) - Nk B T A Tr g - 1 



To this end, we note that 



Thus, we have 



SF 
5g(cj>) 



8g{4>) 



iNJ(0) • - 



Trfee**') e" # + Tr( 



Nk„T ■ — 
2?r 



In + 1 



cos<^ 
"2tT 



iVfc R T • — 
2tt 



(7.136) 



(7.137) 



14 The denominator of 27r in the measure is not necessary, and in fact it is even slightly cumbersome. It divides out whenever we take a ratio 
to compute a thermodynamic average. I introduce this factor to preserve the relation Tr 1 = 1. I personally find unnormalized traces to be 
profoundly unsettling on purely aesthetic grounds. 
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Now let us define 

Tr(^e^) = J^- q{4>) e** = me 1 ^ . (7.138) 

We then have 

In q(4>) = y^rn cos(0 - O ) + JL cos<£ + A - 1. (7.139) 

Clearly the free energy will be reduced if O = so that the mean field is maximal and aligns with the external 
field, which prefers = 0. Thus, we conclude 

Q(<t>)=C exp(^f cos^ , (7.140) 

where 

H cS = J(0) m + H (7.141) 
and C = e . The value of A is then determined by invoking the constraint, 

Tr q = 1 = cf^- cxp cos A = C I (H eS /k B T) , (7.142) 

where Iq(z) is the Bessel function. We are free to define 

e= ^§ , (7.143) 

and to treat e as our single variational parameter. 
We then have the normalized density matrix 

„£COS0 p£ COS <f> 

B {4>) = - = -— . (7.144) 

r^l e ecos0' J( H e J 

— 7T 

We next compute the following averages: 

7T 

,±t<4\ „/iA„±# _ J l( £ ) 



<e^) = y^^)e^ = ^ (7.145) 

— 7T 

<cos(0 - 0')) = Re (e* e"**') = , (7.146) 



as well as 



Tr( e ln e ) = /^-— T i £C os</)-lnIo(e)}=e 7 1 H-ln/o(£). (7.147) 

— 7T 

The dimensionless free energy per site is therefore 



f(e,h,8) = -| f ^fl) 2 + M - ft) ^fl - ln W , (7.148) 



with (9 = k B T/J(0) and ft = ff/J(0) and / = F/NJ(0) as before. 
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For small e, we may expand the Bessel functions, using 

k\T{k + v + l) ' 



°° (-z 2 ) k 



k=0 

to obtain 

/(e, ft, 6) = \ (6 - |) e 2 + £ (2 - 39) e 4 - § he + £ he 3 + . . . . (7.150) 

This predicts a second order phase transition at 9 C = i. 15 Note also the Curie- Weiss form of the susceptibility at 
high 9: 



7.6 Landau Theory of Phase Transitions 

Landau's theory of phase transitions is based on an expansion of the free energy of a thermodynamic system in 
terms of an order parameter, which is nonzero in an ordered phase and zero in a disordered phase. For example, 
the magnetization M of a ferromagnet in zero external field but at finite temperature typically vanishes for tem- 
peratures T > T c , where T c is the critical temperature, also called the Curie temperature in a ferromagnet. A low 
order expansion in powers of the order parameter is appropriate sufficiently close to the phase transition, i.e. at 
temperatures such that the order parameter, if nonzero, is still small. 

The simplest example is the quartic free energy, 

f(m, h = 0,6) = f + \am 2 + \bm 4 , (7.152) 

where /o = fo{0), a = a(9), and b = b(9). Here, 9 is a dimensionless measure of the temperature. If for example 
the local exchange energy in the ferromagnet is J, then we might define 9 = k B T / zJ , as before. Let us assume 
b > 0, which is necessary if the free energy is to be bounded from below 16 . The equation of state , 

^- = = am + bm 3 , (7.153) 

dm 

has three solutions in the complex m plane: (i) m = 0, (ii) m = yj—a/b, and (iii) m = — y/ '—a/b. The latter two 
solutions lie along the (physical) real axis if a < 0. We assume that there exists a unique temperature 9 C where 
a(9 c ) = 0. Minimizing /, we find 

a 2 

0<9 C : f(9)=f -- (7154) 
9 > 9 C : f(9) - f . 

The free energy is continuous at 9 C since a(6 c ) = 0. The specific heat, however, is discontinuous across the transi- 
tion, with 

„ d 2 



iOt) - c{9~) 



89 2 



46 J 2b(9 c ) 



'(So)] 2 

(7.155) 



15 Note that the coefficient of the quartic term in e is negative for 9 > |. At 9 = 9 C = ~, the coefficient is positive, but for larger 9 one must 
include higher order terms in the Landau expansion. 

16 It is always the case that / is bounded from below, on physical grounds. Were b negative, we'd have to consider higher order terms in the 
Landau expansion. 
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Figure 7.14: Phase diagram for the quartic mean field theory / = /q + \am 2 + jbm A — hm, with b > 0. There is 
a first order line at ft. = extending from a = — oo and terminating in a critical point at a = 0. For \h\ < h*(a) 
(dashed red line) there are three solutions to the mean field equation, corresponding to one global minimum, one 
local minimum, and one local maximum. Insets show behavior of the free energy f(m). 



The presence of a magnetic field h breaks the Z 2 symmetry of : 



-m. The free energy becomes 



/(m, h, 0) = fo + ^am 2 + \bm A — hm 



and the mean field equation is 



bm d 



am 







(7.156) 



(7.157) 



This is a cubic equation for m with real coefficients, and as such it can either have three real solutions or one real 
solution and two complex solutions related by complex conjugation. Clearly we must have a < in order to have 



three real roots, since bm 3 



is monotonically increasing otherwise. The boundary between these two classes 



of solution sets occurs when two roots coincide, which means f"(m) 
solving these two equations, we find 



or, equivalently, 



_2 {-af' 2 

W ~ 3 3 / 2 &V2 



a(h) = ^b^\h\ 2 '\ 



as well as f'(m) = 0. Simultaneously 



(7.158) 



(7.159) 



If, for fixed h, we have a < a*(h), then there will be three real solutions to the mean field equation f'(m) = 0, one 
of which is a global minimum (the one for which m • h > 0). For a > a* (h) there is only a single global minimum, 
at which m also has the same sign as h. If we solve the mean field equation perturbatively in h/a, we find 



m(a, h) = t h 

a a 4 



±- 



11/2 



h 3h 1 /2 



&V 2 2\a\ 8|a| 5 / 2 



(a > 0) 
(a < 0) . 



(7.160) 
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7.6.1 Cubic terms in Landau theory : first order transitions 



Next, consider a free energy with a cubic term, 



f = fo + ham 2 — §?/TO 3 + \bm A 



(7.161) 



with b > for stability. Without loss of generality, we may assume y > (else send m — ► — to). Note that we no 
longer have m — > — to (z'.e. Z 2 ) symmetry. The cubic term favors positive to. What is the phase diagram in the (a, y) 
plane? 



Extremizing the free energy with respect to m, we obtain 

Of 



dm 



= am — ym + 6m 



(7.162) 



This cubic equation factorizes into a linear and quadratic piece, and hence may be solved simply. The three 
solutions are m = and 



- y _l ( y Y a 



(7.163) 



We now see that for y 2 < 4ab there is only one real solution, at to = 0, while for y 2 > 4ab there are three real 



solutions. Which solution has lowest free energy? To find out, we compare the energy /(0) with /(r 
we set 

/(m) = /(0) =>• \am 2 - \yn? + \bm^ = , 
and we now have two quadratic equations to solve simultaneously: 

= a — ym + bm 2 

= ha — hym + \bm 2 = . 



\17 



Thus, 



(7.164) 



(7.165) 



Eliminating the quadratic term gives m 



Thus, we have the following: 



a > 



-ib 

2y 2 



3a/ y. Finally, substituting \ 



y 2 = \ab 



1 real root m = 



gives us a relation between a, b, and 

(7.166) 



- — > a > : 3 real roots; minimum at m = 

4& 9b 



(7.167) 



9& 



> a 



3 real roots; minimum at m = — 

2b 



2b 



a 
T, 



The solution m = lies at a local minimum of the free energy for a > and at a local maximum for a < 0. Over 

2 2 i 2 

the range > a > then, there is a global minimum at m = 0, a local minimum at m = m + , and a local 

maximum at m = m_, with m + > m_ > 0. For 4fj- > a > 0, there is a local minimum at a = 0, a global minimum 
at m = m + , and a local maximum at to = m_, again with to + > m_ > 0. For a < 0, there is a local maximum at 
to = 0, a local minimum at to = m_, and a global minimum at m = to + , with m + > > m_. See fig. 7.15. 



17 We needn't waste our time considering the m = m_ solution, since the cubic term prefers positive m. 
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Figure 7.15: Behavior of the quartic free energy /(to) = ^am 2 — ^ym 3 + \bm A . A: y 2 < 4ab ; B: 4a6 < y 2 < |afe 
; C and D: y 2 > |a&. The thick black line denotes a line of first order transitions, where the order parameter is 
discontinuous across the transition. 



7.6.2 Magnetization dynamics 

Suppose we now impose some dynamics on the system, of the simple relaxational type 



dm = r d f 

dt dm 



(7.168) 



where r is a phenomenological kinetic coefficient. Assuming y > and b > 0, it is convenient to adimensionalize 
by writing 



TO = — • U 

b 



Then we obtain 



a = — ■ r 
b 



du d(p 
ds du ' 



t= Ty^- S 



where the dimensionless free energy function is 

ip(u) = \ru 2 - \u z + \u 4 . 



(7.169) 
(7.170) 

(7.171) 



We see that there is a single control parameter, r. The fixed points of the dynamics are then the stationary points 
of f{u), where (f'(u) = 0, with 

<p'(u) =u{r-u + u 2 ) . (7.172) 



The solutions to ip' (u) = are then given by 

u* = 



u* = i± A M-r. 



(7.173) 



For r > | there is one fixed point at u = 0, which is attractive under the dynamics u = —ip'(u) since (p"{0) = r. At 
r = j there occurs a saddle-node bifurcation and a pair of fixed points is generated, one stable and one unstable. 
As we see from fig. 7.14, the interior fixed point is always unstable and the two exterior fixed points are always 
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Figure 7.16: Fixed points for ip(u) — ^ru 2 — ^u 3 + ju 4 and flow under the dynamics u = —(p'(u). Solid curves 
represent stable fixed points and dashed curves unstable fixed points. Magenta arrows show behavior under 
slowly increasing control parameter r and dark blue arrows show behavior under slowly decreasing r. For u > 
there is a hysteresis loop. The thick black curve shows the equilibrium thermodynamic value of u(r), i.e. that value 
which minimizes the free energy ip(u). There is a first order phase transition at r = |, where the thermodynamic 
value of u jumps from u = to u = |. 



stable. At r = there is a transcritical bifurcation where two fixed points of opposite stability collide and bounce 
off one another (metaphorically speaking). 

At the saddle-node bifurcation, r = | and u = |, and we find ip(u = |; r = j) = which is positive. Thus, 
the thermodynamic state of the system remains at u = until the value of (fi(u + ) crosses zero. This occurs when 
<p(u) = and ip'(u) = 0, the simultaneous solution of which yields r = | and u = |. 

Suppose we slowly ramp the control parameter r up and down as a function of the dimensionless time s. Under 
the dynamics of eqn. 7.170, u(s) flows to the first stable fixed point encountered - this is always the case for 
a dynamical system with a one-dimensional phase space. Then as r is further varied, u follows the position of 
whatever locally stable fixed point it initially encountered. Thus, u(r(s)) evolves smoothly until a bifurcation is 
encountered. The situation is depicted by the arrows in fig. 7.16. The equilibrium thermodynamic value for u(r) is 
discontinuous; there is a first order phase transition at r = |, as we've already seen. As r is increased, u(r) follows 
a trajectory indicated by the magenta arrows. For an negative initial value of u, the evolution as a function of r 
will be reversible. However, if u(0) is initially positive, then the system exhibits hysteresis, as shown. Starting with 
a large positive value of r, u(s) quickly evolves to u = + , which means a positive infinitesimal value. Then as r 
is decreased, the system remains at u = + even through the first order transition, because u = is an attractive 
fixed point. However, once r begins to go negative, the u = fixed point becomes repulsive, and u(s) quickly 

flows to the stable fixed point u + = ^ + J\ — r. Further decreasing r, the system remains on this branch. If r is 
later increased, then u(s) remains on the upper branch past r = 0, until the u + fixed point annihilates with the 

unstable fixed point at u_ = ^ — -J j — r, at which time u(s) quickly flows down to u = + again. 



326 



CHAPTER 7. MEAN FIELD THEORY OF PHASE TRANSITIONS 



7.6.3 Sixth order Landau theory : tricritical point 

Finally, consider a model with Z 2 symmetry, with the Landau free energy 

/ = f + \am 2 + \bm i + ±cto 6 , (7.174) 
with c > for stability. We seek the phase diagram in the (a, b) plane. Extremizing / with respect to m, we obtain 

^- = = m (a + bm 2 + cm 4 ) , (7.175) 
dm 

which is a quintic with five solutions over the complex m plane. One solution is obviously m = 0. The other four 
are 



m = ±i 



b / / b \ 2 a 



\ 2c V \2c 



±\l- --. (7.176) 



c 



For each ± symbol in the above equation, there are two options, hence four roots in all. 

If a > and b > 0, then four of the roots are imaginary and there is a unique minimum at to = 0. 

For a < 0, there are only three solutions to f'(m) = for real m, since the — choice for the ± sign under the radical 
leads to imaginary roots. One of the solutions is m = 0. The other two are 



The most interesting situation is a > and b < 0. If a > and b < — 2 v / ac, all five roots are real. There must be 
three minima, separated by two local maxima. Clearly if m* is a solution, then so is — m*. Thus, the only question 
is whether the outer minima are of lower energy than the minimum at m = 0. We assess this by demanding 
/(to*) = /(0), where m* is the position of the largest root (i.e. the rightmost minimum). This gives a second 
quadratic equation, 

= \a + {bm 2 + \cm 4 , (7.178) 

which together with equation 7.175 gives 

6=--^V5H- (7-179) 

Thus, we have the following, for fixed a > 0: 

b > —2y/ac : 1 real root to = 
-2 Vac > b > — Vac : 5 real roots; minimum at to = (7.180) 



■-j= \fac > b : 5 real roots; minima at to = ±y — ^- + y 



c 



The point (a, 6) = (0, 0), which lies at the confluence of a first order line and a second order line, is known as a 
tricritical point. 

7.6.4 Hysteresis for the sextic potential 



Once again, we consider the dissipative dynamics to = 



— r /'(to). We adimensionalize by writing 
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Figure 7.17: Behavior of the sextic free energy f(m) = ^am 2 + jbm 4 + ^cm 6 . A: a > and b > ; B: a < and 
b > ; C: a < and b < ; D: a > and b < —^^Jac ; E: a > and — -j^-Jac < b < —2^Jac ; F: a > and 
< b < 0. The thick dashed line is a line of second order transitions, which meets the thick solid line of first 
order transitions at the tricritical point, (a, b) = (0, 0). 



Then we obtain once again the dimensionless equation 



du dip 
ds du 



(7.182) 



where 

<p(u) = \ru 2 ± iu 4 + iw 6 . (7.183) 

In the above equation, the coefficient of the quartic term is positive if b > and negative if b < 0. That is, the 
coefficient is sgn(6). When b > we can ignore the sextic term for sufficiently small u, and we recover the quartic 
free energy studied earlier. There is then a second order transition at r = 0. . 

New and interesting behavior occurs for b > 0. The fixed points of the dynamics are obtained by setting tp'(u) = 0. 
We have 



(p(u) = \ru 2 - \u 4 + iu 6 
<p'(u) = u(r — u 2 + u 4 ) . 



(7.184) 
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Thus, the equation tp' (u) = factorizes into a linear factor u and a quartic factor u 4 — u 2 + r which is quadratic in 
u 2 . Thus, we can easily obtain the roots: 




(7.185) 



r > \ : u* = . 



In fig. 7.19, we plot the fixed points and the hysteresis loops for this system. At r = \, there are two symmetrically 
located saddle-node bifurcations at u = ±^|- We find ip(u = ±-^=,r = |) = which is positive, indicating 
that the stable fixed point u* = remains the thermodynamic minimum for the free energy if(u) as r is decreased 
through r = j. Setting <p(u) = and f'{u) = simultaneously, we obtain r = and u = ±-^. The thermody- 
namic value for u therefore jumps discontinuously from u = to u = (either branch) at r = j^; this is a first 
order transition. 

Under the dissipative dynamics considered here, the system exhibits hysteresis, as indicated in the figure, where 
the arrows show the evolution of u(s) for very slowly varying r(s). When the control parameter r is large and 
positive, the flow is toward the sole fixed point atu* = 0. Atr = |, two simultaneous saddle-node bifurcations 
take place at u* = ±-^; the outer branch is stable and the inner branch unstable in both cases. At r = there is a 
subcritical pitchfork bifurcation, and the fixed point at u* = becomes unstable. 

Suppose one starts off with r ^ | with some value u > 0. The flow ii — —(p'(u) then rapidly results in u — > + . 
This is the 'high temperature phase' in which there is no magnetization. Now let r increase slowly, using s as 
the dimensionless time variable. The scaled magnetization u(s) = u* (r(s)) will remain pinned at the fixed point 
u* = + . As r passes through r = j, two new stable values of u* appear, but our system remains at u = + , since 
u* = is a stable fixed point. But after the subcritical pitchfork, u* = becomes unstable. The magnetization u(s) 

then flows rapidly to the stable fixed point at u* = and follows the curve u* (r) = (| + (| — r) 1 / 2 ) ^ 2 for all 
r < 0. 
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Figure 7.19: Fixed points <p'(u*) = for the sextic potential tp (u) = \ru 2 — ju 4 + ^u 6 , and corresponding dynamical 
flow (arrows) under it = —tp'(u). Solid curves show stable fixed points and dashed curves show unstable fixed 
points. The thick solid black and solid grey curves indicate the equilibrium thermodynamic values for u; note the 
overall u — > —u symmetry. Within the region r € [0, 4] the dynamics are irreversible and the system exhibits the 
phenomenon of hysteresis. There is a first order phase transition at r = j^. 



Now suppose we start increasing r {i.e. increasing temperature). The magnetization follows the stable fixed point 

u*(r) = (-| + (-j — r) 1 / 2 ) 1//2 past r = 0, beyond the first order phase transition point at r = j^, and all the way up 
to r = \, at which point this fixed point is annihilated at a saddle-node bifurcation. The flow then rapidly takes 
u — ► u* = + , where it remains as r continues to be increased further. 

Within the region r £ [0, 4] of control parameter space, the dynamics are said to be irreversible and the behavior 
of u(s) is said to be hysteretic. 



7.7 Mean Field Theory of Fluctuations 

7.7.1 Correlation and response in mean field theory 

Consider the Ising model, 

k = - i E J u °i a i - E H k °k > (7.186) 

i,j k 

where the local magnetic field on site k is now H k . We assume without loss of generality that the diagonal terms 
vanish: J u = 0. Now consider the partition function Z = Tr e~^ H as a function of the temperature T and the local 
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field values {H^}. We have 



Thus, 



dZ 



d 2 z 



X 



dF 

dm. 



/?Tr 
/? 2 Tr 



a a e 



13 Z ■ (a,) 



-fiH 



(7.187) 



d 2 F 



8H i dH j 



(7.188) 



Expressions such as (e^), {a i of), etc. are in general called correlation functions. For example, we define the spin-spin 
correlation function C i} as 

Expressions such as J^- and gff-§jj are called response functions. The above relation between correlation functions 

and response functions, = k^TX^, is valid only for the equilibrium distribution. In particular, this relationship 
is invalid if one uses an approximate distribution, such as the variational density matrix formalism of mean field 
theory. 

The question then arises: within mean field theory, which is more accurate: correlation functions or response 
functions? A simple argument suggests that the response functions are more accurate representations of the real 
physics. To see this, let's write the variational density matrix g™ as the sum of the exact equilibrium (Boltzmann) 
distribution g cq = Z^ 1 cxp(—f)H) plus a deviation Sg: 



Then if we calculate a correlator using the variational distribution, we have 

( <7 i a ]) va r = Tr 

= Tr 



Tr 



(7.190) 



(7.191) 



Thus, the variational density matrix gets the correlator right to first order in 6g. On the other hand, the free energy 
is given by 



F v 



F cq + Y^ 



dF 



d 2 F 



8q„ 5g a , + 



(7.192) 



Here er denotes a state of the system, i.e. \<j) = \ a 1 , . . . , a N ), where every spin polarization is specified. Since the 
free energy is an extremum (and in fact an absolute minimum) with respect to the distribution, the second term 
on the RHS vanishes. This means that the free energy is accurate to second order in the deviation Sg. 



7.7.2 Calculation of the response functions 



Consider the variational density matrix 

i 



(7.193) 
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where 

^ i )=(^)^+(^)^- 1 - (7-194) 
The variational energy E = Tr (g H) is 

E=-±j2 J iJ m i m j-J2 H i m i ( 7 - 195 ) 



and the entropy 5 = — k B T Tr (g In g) is 



^ Infill + ( ^ Inf ^ ] > . (7.196) 



Setting the variation = 0, with F = E-TS, we obtain the mean field equations, 

m, = tanh (/3J tJ m j + (3H % ) , (7.197) 

where we use the summation convention: J i} rrij = J i} m^-. Suppose T > T c and m 4 is small. Then we can 
expand the RHS of the above mean field equations, obtaining 

(6 l3 - (3J l3 ) Tji j = pH t . (7.198) 

Thus, the susceptibility tensor X is the inverse of the matrix (k B T ■ I — J) : 

(/IT) 1 

^■ = ^ = (^-I-J):.\ (7.199) 

where I is the identity. Note also that so-called connected averages of the kind in eqn. 7.189 vanish identically if we 
compute them using our variational density matrix, since all the sites are independent, hence 

fa a,) = Tr ( e ™ a, a Q ) = Tr ( ft tr,) • Tr ( 0j a,-) - fa) • fa) , (7.200) 

and therefore = if we compute the correlation functions themselves from the variational density matrix, 
rather than from the free energy F. As we have argued above, the latter approximation is more accurate. 

Assuming J i} - = J(R i — Rj), where R { is a Bravais lattice site, we can Fourier transform the above equation, 
resulting in 

MQ) = - ^% . = %) H{q) . (7.201) 
Once again, our definition of lattice Fourier transform of a function <f>(R) is 



R 



d d q lf _^ R ^ 



R)=^k^ 



(27T) 

n 

where O is the unit cell in real space, called the Wigner-Seitz cell, and f2 is the first Brillouin zone, which is the unit 
cell in reciprocal space. Similarly, we have 



j(q) = Y^J(R)^--iq-R-^(Q-Rf + - 

R (7.203) 

= J(0)-{l-q 2 Rl+O(q 4 )} , 
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where 



R - = ^m- i7204) 



Here we have assumed inversion symmetry for the lattice, in which case 

R^R v J{R) = 2 ■ s>tu J2 r2j (R) • (7-205) 

R R 

On cubic lattices with nearest neighbor interactions only one has i?* = a/y/2d, where a is the lattice constant and 
d is the dimension of space. 

Thus, with the identification k B T c = J(0), we have 

1 



k B {T-T c ) + k B T c Rl q 2 + 0(q*) 

(7.206) 



1 1 



fc B T c i?2 £-2 + g2 + 0(g4) 

where 

T _ T \-V2 



t = R*- J (7.207) 

is the correlation length. With the definition 

£(T) oc \T-T C \~ V (7.208) 

as T — > T c , we obtain the mean field correlation length exponent v = i. The exact result for the two-dimensional 
Ising model is v = 1, whereas ^ w 0.6 for the d = 3 Ising model. Note that X(q = 0, T) diverges as (T — T c ) _1 for 
T>T C . 

In real space, we have 
where 

X « =n J-0yiXW eig<Ri ~ Rj) - (7 ' 210) 

Note that X(q) is properly periodic under q — > q + G, where G is a reciprocal lattice vector, which satisfies 
e iG R _ ^ £ or an y jj rec t Bravais lattice vector i?. Indeed, we have 



X \q) = k B T-j{q) 



k B T- J^Te^" 5 , 
5 



(7.211) 



where <5 is a nearest neighbor separation vector, and where in the second line we have assumed nearest neighbor 
interactions only. On cubic lattices in d dimensions, there are 2d nearest neighbor separation vectors, S = ±a e. , 
where \i G {1, . . . , d}. The real space susceptibility is then 

j d9 1 J d0 d e'"i e i ■■■e*"^ 

X( ] ~ J 2tt " ' J 2tt k B T - (2 J cos 6 1 + . . . + 2 J cos 0J ' ( ] 

— 7T — 7T 

where i? = a ^„ =1 rc^ is a general direct lattice vector for the cubic Bravais lattice in d dimensions, and the 
{n^} are integers. 
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The long distance behavior was discussed in chapter 6 (see §6.4.9 on Ornstein-Zernike theory 18 ). For convenience 
we reiterate those results: 



• In d = 1, 



In d > 1, with r — > oo and £ fixed, 



X*=i(*) = ( ^4rr^ ) e" N/S • (7.213) 



2k B r c m 



where the C d are dimensionless constants. 

In d > 2, with £ — > oo and r fixed (i.e. T — > T c at fixed separation r), 



«"> = ^'^-{ 1+0 (w u - (7 " 215) 



In d = 2 dimensions we obtain 



W^-^-lnf^e-^-jl + of-l-IL (7.216) 



where the C' d are dimensionless constants. 



7.8 Global Symmetries 

7.8.1 Symmetries and symmetry groups 

Interacting systems can be broadly classified according to their global symmetry group. Consider the following five 
examples: 

Asing = - XI J i3 a * a 3 a i G 

i<j 

Hp-clock = - 2j J ij C0S ( " ) 6 {1,2, . . . ,p} 

i<j VP/ 



ff 3 _Pott 8 = -E J y^^ *i €{l,2,...,q] (7.217) 

ffr,, s = -V^ J ■ & ■ O f2 G 

0(n) / , a.7 « j 2 ^- ^ 

The Ising Hamiltonian is left invariant by the global symmetry group Z 2 , which has two elements, I and 77, with 

V<7i = -Vi- (7-218) 



18 There is a sign difference between the particle susceptibility defined in chapter 6 and the spin susceptibility defined here. The origin of the 
difference is that the single particle potential v as defined was repulsive for v > 0, meaning the local density response Sn should be negative, 
while in the current discussion a positive magnetic field H prefers m > 0. 
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Figure 7.20: A domain wall in a one-dimensional Ising model. 

I is the identity, and rj 2 = I. By simultaneously reversing all the spins a i — > — a it the interactions remain invariant. 

The degrees of freedom of the p-state clock model are integer variables n i each of which ranges from 1 to p. The 
Hamiltonian is invariant under the discrete group Z , whose p elements are generated by the single operation rj, 
where 

n, + 1 if G {l,2,...,p-l} 
1 if n i = p . 

Think of a clock with one hand and p 'hour ' markings consecutively spaced by an angle 2n/p. In each site i, a hand 
points to one of the p hour marks; this determines n i . The operation rj simply advances all the hours by one tick, 
with hour p advancing to hour 1, just as 23:00 military time is followed one hour later by 00:00. The interaction 
cos (2n(n i — rij)/p) is invariant under such an operation. The p elements of the group Z p are then 

l, V ,rj 2 , ... ^p- 1 . (7.220) 




We've already met up with the g-state Potts model, where each site supports a 'spin' a i which can be in any of q 
possible states, which we may label by integers {1 , ... , q}. The energy of two interacting sites i and j is —J^ if 
a i = (ij and zero otherwise. This energy function is invariant under global operations of the symmetric group on q 
characters, S q , which is the group of permutations of the sequence {1,2,3,..., g}. The group S q has ql elements. 
Note the difference between a Z q symmetry and an S q symmetry. In the former case, the Hamiltonian is invariant 
only under the (/-element cyclic permutations, e.g. 



1 2 ■ ■ • q-lq 
23-- q 1 



and its powers rf with I = 0, . . . , q — 1. 



All these models - the Ising, p-state clock, and g-state Potts models - possess a global symmetry group which is 
discrete. That is, each of the symmetry groups Z 2 , Z p , S q is a discrete group, with a finite number of elements. The 

XY Hamiltonian H X y on the other hand is invariant under a continuous group of transformations <p i — > 4> i + a, 
where 4> i is the angle variable on site i. More to the point, we could write the interaction term cos(<^ i — ■) as 
\{z*Zj + z^*), where z i = e l< ^ is a phase which lives on the unit circle, and z* is the complex conjugate of 
z,. The model is then invariant under the global transformation z i — > e la z i . The phases e la form a group under 
multiplication, called U ( 1 ) , which is the same as ( 2 ) . Equivalently, we could write the interaction asfi^fi^, where 

f2 i = (cos^j , sin^j), which explains the 0(2), symmetry, since the symmetry operations are global rotations in 
the plane, which is to say the two-dimensional orthogonal group. This last representation generalizes nicely to 
unit vectors in n dimensions, where 

f2 = {Q 1 , Q 2 , ... , Q n ) (7.221) 



with ft 2 = 1 . The dot product tl i ■ fl- is then invariant under global rotations in this n-dimensional space, which 
is the group 0(n). 
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Figure 7.21: Domain walls in the two-dimensional (left) and three-dimensional (right) Ising model. 
7.8.2 Lower critical dimension 



Depending on whether the global symmetry group of a model is discrete or continuous, there exists a lower critical 
dimension d e at or below which no phase transition may take place at finite temperature. That is, for d < d e , the 
critical temperature is T c = 0. Owing to its neglect of fluctuations, mean field theory generally overestimates the 
value of T c because it overestimates the stability of the ordered phase. Indeed, there are many examples where 
mean field theory predicts a finite T c when the actual critical temperature is T c = 0. This happens whenever 
d < d g . 

Let's test the stability of the ordered (ferromagnetic) state of the one-dimensional Ising model at low temperatures. 
We consider order-destroying domain wall excitations which interpolate between regions of degenerate, symmetry- 
related ordered phase, i.e. and J, J, J, J, J,. For a system with a discrete symmetry at low temperatures, the 
domain wall is abrupt, on the scale of a single lattice spacing. If the exchange energy is J, then the energy of a 
single domain wall is 2 J, since a link of energy — J is replaced with one of energy + J. However, there are ./V 
possible locations for the domain wall, hence its entropy is k B In N. For a system with M domain walls, the free 
energy is 



F = 2MJ -kJT\n 



= N ■{ 2Jx + k B T xlnx+(l - x)ln(l - 



(7.222) 



where x = M/N is the density of domain walls, and where we have used Stirling's approximation for fc! when k 
is large. Extremizing with respect to x, we find 

1 



1 



= e -2J/k B T 



3 2J/k B T _|_ ^ 



(7.223) 



The average distance between domain walls is x 1 , which is finite for finite T. Thus, the thermodynamic state of 
the system is disordered, with no net average magnetization. 

Consider next an Ising domain wall in d dimensions. Let the linear dimension of the system be L ■ a, where L is 
a real number and a is the lattice constant. Then the energy of a single domain wall which partitions the entire 
system is 2,7 ■ L d ~ l . The domain wall entropy is difficult to compute, because the wall can fluctuate significantly, 
but for a single domain wall we have S > k B In L. Thus, the free energy F = 2JL d ~ 1 — k B T In L is dominated by 
the energy term if d > 1, suggesting that the system may be ordered. We can do a slightly better job in d = 2 by 
writing 



exp 



-2PJ/k B T 



(7.224) 
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where the sum is over all closd loops of perimeter P, and N P is the number of such loops. An example of such a 
loop circumscribing a domain is depicted in the left panel of fig. 7.21. It turns out that 



N P ~ k p 



P- ■ {l + OiP- 1 )} , (7.225) 



where K = z — 1 with z the lattice coordination number, and 6 is some exponent. We can understand the k p factor 
in the following way. At each step along the perimeter of the loop, there are n = z — 1 possible directions to go 
(since one doesn't backtrack). The fact that the loop must avoid overlapping itself and must return to its original 
position to be closed leads to the power law term P~ e , which is subleading since k p P~ 8 = cxp(PlriK — OlnP) 
and P > In P for P > 1. Thus, 

Fn-±L d J2P- e e {lnK - WJ)P , (7-226) 

' p 

which diverges if In k > 2/3 J, i.e. if T > 2J/k B \n(z — 1). We identify this singularity with the phase transition. The 
high temperature phase involves a proliferation of such loops. The excluded volume effects between the loops, 
which we have not taken into account, then enter in an essential way so that the sum converges. Thus, we have 
the following picture: 

In k < 2/3 J : large loops suppressed ; ordered phase 
In k > 2/3 J : large loops proliferate ; disordered phase . 



On the square lattice, we obtain 



fc B T c approx = Y^ = L82J 
2 J 

k yexact = ± = 2.27 J . 

sinh-^l) 



The agreement is better than we should reasonably expect from such a crude argument. 

JVofa bene : Beware of arguments which allegedly prove the existence of an ordered phase. Generally speaking, 
any approximation will underestimate the entropy, and thus will overestimate the stability of the putative ordered 
phase. 



7.8.3 Continuous symmetries 

When the global symmetry group is continuous, the domain walls interpolate smoothly between ordered phases. 
The energy generally involves a stiffness term, 

E = \ Ps Jd d t{ye)\ (7.227) 

where 6{r) is the angle of a local rotation about a single axis and where p s is the spin stiffness. Of course, in 0(n) 
models, the rotations can be with respect to several different axes simultaneously. 

In the ordered phase, we have 0(r) = 9 , a constant. Now imagine a domain wall in which 6(r) rotates by 2ir 
across the width of the sample. We write 6(r) = lnnxjL, where L is the linear size of the sample (here with 
dimensions of length) and n is an integer telling us how many complete twists the order parameter field makes. 
The domain wall then resembles that in fig. 7.22. The gradient energy is 

L 2 

E = \ Ps L^jdx {^Pj = 2,W Ps . (7.228) 
o 
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Figure 7.22: A domain wall in an XY ferromagnet. 



Recall that in the case of discrete symmetry, the domain wall energy scaled as E oc L^ 1 . Thus, with S > k B \nL 
for a single wall, we see that the entropy term dominates if d < 2, in which case there is no finite temperature 
phase transition. Thus, the lower critical dimension d e depends on whether the global symmetry is discrete or 
continuous, with 

discrete global symmetry ==>• dg = 1 
continuous global symmetry d e — 2 . 

Note that all along we have assumed local, short-ranged interactions. Long-ranged interactions can enhance order 
and thereby suppress d g . 

Thus, we expect that for models with discrete symmetries, d e = 1 and there is no finite temperature phase transi- 
tion for d < 1. For models with continuous symmetries, d e = 2, and we expect T c = for d < 2. In this context 
we should emphasize that the two-dimensional XY model does exhibit a phase transition at finite temperature, 
called the Kosterlitz-Thouless transition. However, this phase transition is not associated with the breaking of the 
continuous global 0(2) symmetry and rather has to do with the unbinding of vortices and antivortices. So there 
is still no true long-ranged order below the critical temperature T KT , even though there is a phase transition! 



7.8.4 Random systems : Imry-Ma argument 

Oftentimes, particularly in condensed matter systems, intrinsic randomness exists due to quenched impurities, 
grain boundaries, immobile vacancies, etc. How does this quenched randomness affect a system's attempt to 
order at T = 0? This question was taken up in a beautiful and brief paper by J. Imry and S.-K. Ma, Phys. Rev. Lett. 
35, 1399 (1975). Imry and Ma considered models in which there are short-ranged interactions and a random local 
field coupling to the local order parameter: 

jJ2 4-^-E^^' ( 7 - 23 °) 

(ij) « 

where 

«i?f>> = , ((H?Hf)) = r5 a < } 6 ijl (7.231) 

where (( • )) denotes a configurational average over the disorder. Imry and Ma reasoned that a system could try 
to lower its free energy by forming domains in which the order parameter takes advantage of local fluctuations in 
the random field. The size of these domains is assumed to be L d , a length scale to be determined. See the sketch 
in the left panel of fig. 7.23. 



"RFI ~ 



RFO(n) 
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L d = (J/H rms ) *■ <0 



Figure 7.23: Left panel : Imry-Ma domains for an 0(2) model. The arrows point in the direction of the local order 
parameter field (f2(r)). Right panel : free energy density as a function of domain size L d . Keep in mind that the 
minimum possible value for L d is the lattice spacing a. 



There are two contributions to the energy of a given domain: bulk and surface terms. The bulk energy is 



bulk 



-H rms (L d /a) d / 2 



(7.232) 



where a is the lattice spacing. This is because when we add together (L d /a) d random fields, the magnitude of the 
result is proportional to the square root of the number of terms, i.e. to {L d /a) d / 2 . The quantity H rms = \/T is the 
root-mean-square fluctuation in the random field at a given site. The surface energy is 



E~ 



rface 



X 



J(L d /a) d (discrete symmetry) 
J (L d /a) d ~ 2 (continuous symmetry) 



(7.233) 



We compute the critical dimension d c by balancing the bulk and surface energies, 



d-1 



¥ 



d-2 = \d 



d c = 2 
d =4 



(discrete) 
(continuous) 



The total free energy is F = (V/ L d ) ■ AE, where AE = E hulk + E surl . Thus, the free energy per unit cell is 

F 



f 



V/a d 



(7.234) 



If d < d c , the surface term dominates for small L d and the bulk term dominates for large L d There is global 
minimum at 



L, 



d H 



J 



(7.235) 



For d > d c , the relative dominance of the bulk and surface terms is reversed, and there is a global maximum at 
this value of L d . 

Sketches of the free energy f(L d ) in both cases are provided in the right panel of fig. 7.23. We must keep in mind 
that the domain size L d cannot become smaller than the lattice spacing a. Hence we should draw a vertical line on 
the graph at L d = a and discard the portion L d < a as unphysical. For d < d c , we see that the state with L d = oo, 
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i.e. the ordered state, is never the state of lowest free energy. In dimensions d < d c , the ordered state is always unstable 
to domain formation in the presence of a random field. 

For d > d c , there are two possibilities, depending on the relative size of J and H Tms . We can see this by evaluating 
f(L d = a) = J — H lms and f(L d = oo) = 0. Thus, if J > H lms , the minimum energy state occurs for L d = oo. In 
this case, the system has an ordered ground state, and we expect a finite temperature transition to a disordered 
state at some critical temperature T c > 0. If, on the other hand, J < H rms , then the fluctuations in H overwhelm 
the exchange energy at T = 0, and the ground state is disordered down to the very smallest length scale (i.e. the 
lattice spacing a). 

Please read the essay, "Memories of Shang-Keng Ma," at http : //sip . clarku . edu/skma . html. 



7.9 Ginzburg-Landau Theory 

7.9.1 Ginzburg-Landau free energy 

Including gradient terms in the free energy, we write 

F[m(x), h(x)} = Jd d x l^f + ±am 2 + \bm 4 + ±cm 6 - ft m + ±k (Vm) 2 + . . . j . (7.236) 

In principle, any term which does not violate the appropriate global symmetry will turn up in such an expansion 
of the free energy, with some coefficient. Examples include hrv? (both m and h are odd under time reversal), 
to 2 (V?7i) 2 , etc. We now ask: what function m(x) extremizes the free energy functional F\m(x) , h(x)]? The 
answer is that m(x) must satisfy the corresponding Euler-Lagrange equation, which for the above functional is 

a m + bm 3 + cm 5 - h - K V 2 to = . (7.237) 

If a > and h is small (we assume b > and c > 0), we may neglect the m 3 and m 5 terms and write 

(a-nV 2 )m = h, (7.238) 

whose solution is obtained by Fourier transform as 

m(9) = , (7-239) 

which, with h(x) appropriately defined, recapitulates the result in eqn. 7.201. Thus, we conclude that 

^ - ^ • < 7 - 240 > 

which should be compared with eqn. 7.206. For continuous functions, we have 

m(q) = fd d xm(x) e~ iq x (7.241) 



We can then derive the result 
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where 

1 [ d d q e iq < x - x ' 
where the correlation length is £ = y/n/a <x (T - T c )-^ 2 , as before. 

If a < then there is a spontaneous magnetization and we write m(x) = m Q + 5m(x). Assuming h is weak, we 
then have two equations 

a + bml + c ml = (7.245) 

(a + ibml + 5cm 4 , - re V 2 ) 5m = h . (7.246) 

If —a > is small, we have m 2 , = —a/36 and 

8m{q) = n h(q) 2 , (7.247) 



-2a + Kq 2 



7.9.2 Domain wall profile 



A particularly interesting application of Ginzburg-Landau theory is its application toward modeling the spatial 
profile of defects such as vortices and domain walls. Consider, for example, the case of Ising (Z 2 ) symmetry with 
h = 0. We expand the free energy density to order m 4 : 

F[m(x)] = Jd d x |/ + \am 2 + ±6m 4 + i K (Vm) 2 | . (7.248) 

We assume a < 0, corresponding to T < T c . Consider now a domain wall, where mix — ► — oo) = — m and 
mix — ► +oo) = +m , where m Q is the equilibrium magnetization, which we obtain from the Euler-Lagrange 
equation, 

am + bm 3 -n V 2 m = , (7.249) 



assuming a uniform solution where Vm = 0. This gives m = yj\a\/b. It is useful to scale m,(x) by m , writing 
mix) = m Q 4>{x). The scaled order parameter function 4>(x) interpolates between <p(— oo) = —1 and 0(+oo) = 1. 

1 /2 

It also proves useful to rescale position, writing x = (2/s/|a|) ' C- Th en we obtain 

\ V 2 cj) = -<f> + cj) 3 . (7.250) 
We assume (f>iC) = </>(C) is only a function of one coordinate, C = C 1 - Then the Euler-Lagrange equation becomes 

where 

C/(0) = -i(0 2 -l) 2 . (7.252) 

The 'potential' U((f>) is an inverted double well, with maxima at <j) = ±1. The equation tj> = —U'(<f>), where dot 
denotes differentiation with respect to (, is simply Newton's second law with time replaced by space. In order 
to have a stationary solution at £ — > ±oo where <f> = ±1, the total energy must be E = U(4> = ±1) = 0, where 
E = i<^ 2 + U(4>). This leads to the first order differential equation 

^ = l-0 2 , (7.253) 
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with solution 

4>{() = tanh(C) . (7.254) 

Restoring the dimensionful constants, 

m(x) = m tanh.^ — > (7.255) 

1 /2 

where the coherence length £ = («/|a|) diverges at the Ising transition a = 0. 
7.9.3 Derivation of Ginzburg-Landau free energy 

We can make some progress in systematically deriving the Ginzburg-Landau free energy. Consider the Ising 
model, 

= - \ E K a °i a 3 - E h i °i + \ E K « > ( 7 - 256 ) 



KT 



where now = J^Jk s T and h i = H i /k B T are the interaction energies and local magnetic fields in units of 
k-gT. The last term on the RHS above cancels out any contribution from diagonal elements of K^. Our derivation 
makes use of a generalization of the Gaussian integral, 

00 1/2 

Jdxe- l * ax2 - bx =(j?) e" 2 / 2a . (7.257) 

— OO 

The generalization is 

OO oc 



lx N e-hA^i-h^ = { 2 *} N/ ] esV^i , 



-oo — OO 



where we use the Einstein convention of summing over repeated indices, and where we assume that the matrix A 
is positive definite (else the integral diverges). This allows us to write 



Z = e"^» Tr 



1 6—1 



N e i^-^i Tr e Wi+ h *)^ 



dct- 1/2 (2nK) e ~^ K " Jd^-'-Ja 

— OO — OO 
OO OO 

det _1/2 (27riY ) e~^ K » Jd^-- J d<j) N e 

— OO — OO 

D OO 

#!•■•/ d<t> N e -*Wi'-'*") , (7.259) 



-oo — OO 



where 



* = 5 E A V ^ 0j ~ E ln cosh (^ + K) + I ln det ( 27rA + I Tr K - N ln 2 • (7.260) 
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We assume the model is defined on a Bravais lattice, in which case we can write 4> i = <f> R . We can then define the 
Fourier transforms, 



j^X^e** (7.261) 

m^tR**-** ( 7 - 262 ) 



U R 



N R 
and 

K{q) = J2 K ( R ) e ~ iq R ■ ( 7 - 263 ) 
R 

A few remarks about the lattice structure and periodic boundary conditions are in order. For a Bravais lattice, we 
can write each direct lattice vector R as a sum over d basis vectors with integer coefficients, viz. 

d 

^ = E ? v a M' ( 7 - 264 ) 

where d is the dimension of space. The reciprocal lattice vectors satisfy 

% • b u = 27r <W , (7.265) 

and any wavevector q may be expressed as 



(7.267) 



q 2, 

n=i 

We can impose periodic boundary conditions on a system of size M 1 x M 2 x • • • x M d by requiring 

This leads to the quantization of the wavevectors, which must then satisfy 

e iM » = e «V, = l , (7.268) 

and therefore 6^ = , where m M is an integer. There are then M 1 M 2 ■ ■ ■ M d = N independent values of q, 

which can be taken to be those corresponding to m„ G {1, . . . , M^}. 

Let's now expand the function $(cp) in powers of the <p i , and to first order in the external fields h { . We obtain 

$ = 3 E -l)W 2 + nE^-E ft * ^ + O(0 6 , ft 2 ) (7.269) 

+ | Tr /v + i Tr ln(27r/v ) - N In 2 

On a d-dimensional lattice, for a model with nearest neighbor interactions K l only, we have K (q) = K l e tq S , 
where 6 is a nearest neighbor separation vector. These are the eigenvalues of the matrix K^. We note that is 
then not positive definite, since there are negative eigenvalues 19 . To fix this, we can add a term K Q everywhere 
along the diagonal. We then have 

K(q) = K + K x ]T cos(g ■ S) . (7.270) 
S 



19 To evoke a negative eigenvalue on a c£-dimensional cubic lattice, set q = — for all fi. The eigenvalue is then —2dK 1 . 
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Here we have used the inversion symmetry of the Bravais lattice to eliminate the imaginary term. The eigenvalues 
are all positive so long as K Q > zK x , where z is the lattice coordination number. We can therefore write K(q) = 
K(0) — aq 2 for small q, with a > 0. Thus, we can write 

K- l (q)-l = a + Kq 2 + ... . (7.271) 

To lowest order in q the RHS is isotropic if the lattice has cubic symmetry but anisotropy will enter in higher 
order terms. We'll assume isotropy at this level. This is not necessary but it makes the discussion somewhat less 
involved. We can now write down our Ginzburg-Landau free energy density: 

J" = a0 2 + i K |V0| 2 + ^0 4 -/i0, (7.272) 

valid to lowest nontrivial order in derivatives, and to sixth order in <f>. 

One might wonder what we have gained over the inhomogeneous variational density matrix treatment, where 
we found 



F = -sE J M Ms)l 2 - E Mi) 

Q Q 



9 x / x / x / N -i (7.273) 

1 + m 4 \ , / 1 + to,- \ / 1 — 771,- \ , / 1 — to,- 



Surely we could expand J(q) = j(0) — ^aq 2 + . . . and obtain a similar expression for T. However, such a derivation 
using the variational density matrix is only approximate. The method outlined in this section is exact. 

Let's return to our complete expression for $: 

R 

where 

^E G_1 (9) l^)l 2 + i Tr (tttft) + ^ Tr 111 (ttzfO " Nhx2 ■ {7275) 



Here we have defined 



v(ej)) = \<p 2 — In cosh 



(7.276) 



G ^ = T^Sk ■ < 7 - 277) 

1 - K(q) 

Z= m^ e -* o( ^ e-^R^s) (7.278) 



and 



We now want to compute 



where 

D$ = #j # 2 • • • . (7.279) 
We expand the second exponential factor in a Taylor series, allowing us to write 

z = z o (i - E < u ^h)) + IEE u (<M> + ■■■)' (7 - 280) 



344 



CHAPTER 7. MEAN FIELD THEORY OF PHASE TRANSITIONS 



where 



In Z = i Tr 



ln(l + G) - 



G 



1 + G 



TV In 2 



and 



JD$F 



,-*n 



To evaluate the various terms in the expansion of eqn. 7.280, we invoke Wick's theorem, which says 



oo oo 



oo oo 



' rvt rv* . . . T* 

i Ju ■ JLj ■ Ju ■ 



) = / dx 1 ■ ■ ■ f dx 



J N 



: y - XX- 



X. X- ■ ■ ■ X. 



Jdx 1 - J dx N e a s i/ x i x j 



-oo — oo 



— oo — oo 



V g. .g. . ■■■g. . , 



all distinct 
pairings 



where the sets {j u . . . , j 2L } are all permutations of the set {z l5 . . . , z 2L }. In particular, we have 

<4> = 3(a ii ) 2 . 

In our case, we have 

Thus, if we write u(<^>) ~ yij <^ 4 and retain only the quartic term in u(0), we obtain 



= — In Z = i Tr 



G 



1 + G 



- ln(l + G) 



+ — (Tr G) 2 - AT In 2 



-7Vln2 + i L(TrG) 2 -iTr(G 2 ) + 0(G 3 ) 



(7.281) 



(7.282) 



(7.283) 



(7.284) 
(7.285) 



(7.286) 



Note that if we set to be diagonal, then K(q) and hence G{q) are constant functions of q. The C(G 2 ) term 
then vanishes, which is required since the free energy cannot depend on the diagonal elements of . 



7.9 A Ginzburg criterion 

Let us define A(T, H, V, N) to be the usual (i.e. thermodynamic) Helmholtz free energy. Then 

= [Dme-W™™ , (7.287) 



e 



where the functional F[m[x)] is of the Ginzburg-Landau form, given in eqn. 7.248. The integral above is a 
functional integral. We can give it a more precise meaning by defining its measure in the case of periodic functions 
m(x) confined to a rectangular box. Then we can expand 

m{x) = -U y m„ e iq x , (7.288) 
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and we define the measure 

Dm = dm J^J d Re m q d Im m q . (7.289) 



9 



Note that the fact that m(x) € K means that m_ q = m* q . We'll assume T > T c and H = and we'll explore limit 
T — ► T+ from above to analyze the properties of the critical region close to T c . In this limit we can ignore all but 
the quadratic terms in m, and we have 



= jDm exp (- ±(3^2(a + K q 2 ) \m q \ 2 



=n 

Q 

Thus, 



Q 



a + nq 2 



1/2 (7.290) 



We now assume that a(T) = at, where t is the dimensionless quantity 



T - T 

t = , (7.292) 



known as the reduced temperature. 



We now compute the heat capacity C v — — T |^4. We are really only interested in the singular contributions to 
C v , which means that we're only interested in differentiating with respect to T as it appears in a(T). We divide 
by N s k B where N s is the number of unit cells of our system, which we presume is a lattice-based model. Note 
N s ~ V/a d where V is the volume and a the lattice constant. The dimensionless heat capacity per lattice site is 
then 

c= Cy = ^ A fJ*l 1 

" iV s 2k 2 J {2n) d (r 2 + q 2 ) 2 ' { 

where £ = (n/at) 1 / 2 oc |t| -1 ^ 2 is the correlation length, and where A ~ a -1 is an ultraviolet cutoff. We define 
= (n/a) 1 / 2 , in which case 

(const, if d > 4 
-mi ifd = 4 (7.295) 



where q=q£. Thus, 



[t^ 2 ifd<4 



For d > 4, mean field theory is qualitatively accurate, with finite corrections. In dimensions d < 4, the mean 
field result is overwhelmed by fluctuation contributions as t — > + (i.e. as T — > T" c + ). We see that MFT is sensible 
provided the fluctuation contributions are small, i.e. provided 

R~ 4 a d ^~ d < 1 , (7.296) 



which entails t > t G , where 
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is the Ginzburg reduced temperature. The criterion for the sufficiency of mean field theory, namely t » t G , is known 
as the Ginzburg criterion. The region |t| < t G is known as the critical region. 

In a lattice ferromagnet, as we have seen, ~ a is on the scale of the lattice spacing itself, hence t G ~ 1 and 
the critical regime is very large. Mean field theory then fails quickly as T — > T c . In a (conventional) three- 
dimensional superconductor, i?^ is on the order of the Cooper pair size, and R^/a ~ 10 2 — 10 3 , hence t G = 
(a/R^) 6 ~ 10~ 18 — 10~ 12 is negligibly narrow. The mean field theory of the superconducting transition - BCS 
theory - is then valid essentially all the way to T = T c . 



1-mV i 1 — m 
In 



7.10 Appendix I : Equivalence of the Mean Field Descriptions 

In both the variational density matrix and mean field Hamiltonian methods as applied to the Ising model, we 
obtained the same result m = tanh ((m + h)/9). What is perhaps not obvious is whether these theories are in fact 
the same, i.e. if their respective free energies agree. Indeed, the two free energy functions, 

f A (m, M) = -i rn 2 -hm + e{ H-^j In (l±^) + 
/ B (m, h, 9) = +| m 2 - 9 In f e +(™+h)/0 + e ~(m+h)/e\ ; (7.298) 

where / A is the variational density matrix result and / B is the mean field Hamiltonian result, clearly are different 
functions of their arguments. However, it turns out that upon minimizing with respect to m in each cast, the 
resulting free energies obey f A (h, 9) = f B (h, 9). This agreement may seem surprising. The first method utilizes an 
approximate (variational) density matrix applied to the exact Hamiltonian H. The second method approximates 
the Hamiltonian as H MF , but otherwise treats it exactly. The two Landau expansions seem hopelessly different: 

/ A (m, h, 9) = -9 In 2 - hm + | (6 - 1) m 2 + ^ m 4 + ^m 6 + ... (7.299) 
/,<„,, U, 0) _ -t 1.2 + K - M + ^ - M + . . . . (7300, 

We shall now prove that these two methods, the variational density matrix and the mean field approach, are in 
fact equivalent, and yield the same free energy f(h, 9). 

Let us generalize the Ising model and write 

H = -J2 Jij e(<Ti,<Tj) - ' ( 7301 ) 

i<j i 

Here, each 'spin' <7j may take on any of K possible values, {s 1) . . . , s K }. For the 5=1 Ising model, we would 
have K = 3 possibilities, with s 1 = —1, s 2 = 0, and s 3 = +1. But the set {s a }, with a E {1, . . . , K}, is completely 
arbitrary 20 . The 'local field' term Q(a) is also a completely arbitrary function. It may be linear, with $(cr) = Ha, 
for example, but it could also contain terms quadratic in a, or whatever one desires. 

The symmetric, dimensionless interaction function e(a, a') = e(a', a) is areal symmetric K x K matrix. According 
to the singular value decomposition theorem, any such matrix may be written in the form 

£ (a, a') = ^ A p A p (er) X p (a') , (7.302) 
P =i 

where the {A p } are coefficients (the singular values), and the {A p ((t)} are the singular vectors. The number of 
terms N s in this decomposition is such that iV s < K. This treatment can be generalized to account for continuous 
a. 

20 It needn't be an equally spaced sequence, for example. 
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7.10.1 Variational Density Matrix 

The most general single-site variational density matrix is written 

K 



(7.303) 



a = l 



Thus, x a is the probability for a given site to be in state a, with a = s a . The {x a } are the K variational parameters, 
subject to the single normalization constraint, J2 a x a — 1- We now have 



/ = { Tr ( Q H) + k B T Tr ( e In e ) 

= ~5 A P A p( S o) A p( S a') x a X a , ~ ^ <P( S a) %a + J2 x a inx a 



(7.304) 



where <p{d) = $(er)/ J(0). We extremize in the usual way, introducing a Lagrange undetermined multiplier £ to 
enforce the constraint. This means we extend the function /({x a }), writing 



f*{x 1 ,...,x K ,Q = f{x 1 ,...,x K ) + C, ^i a -l 



K 



(7.305) 



and freely extremizing with respect to the (K + 1) parameters {x 1: . . . ,x K , £}. This yields K nonlinear equations, 

df* 







<9x„ 



J2J2 a p vo Vv) v - ^( S a) + ^X a + c + 9 

p a' 



(7.306) 



for each a, and one linear equation, which is the normalization condition, 

df* 







(7.307) 



We cannot solve these nonlinear equations analytically, but they may be recast, by exponentiating them, as 



1 J 1 

X a = 77 CX P \ 7T 



A p(0 A p( S a') + 



with 



z = e (cm+i =x;«p(Meea. vo a ? (v) ^ + </>( 



p a' 



From the logarithm of x Q , we may compute the entropy, and, finally, the free energy: 

/(M) = ^E A P VO A p(«a') x a x a> ~0^Z , 



(7.308) 



(7.309) 



(7.310) 



p a, a' 



which is to be evaluated at the solution of 7.306, {x* (ft, 0)} 
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7.10.2 Mean Field Approximation 



We now derive a mean field approximation in the spirit of that used in the Ising model above. We write 

X p (a) = (X p (a)) + 6X p (a) , (7.311) 
and abbreviate A p = (A p (er)), the thermodynamic average of Ap(er) on any given site. We then have 



Ap(cr) Ap(cr') = Ap + A p SX p {a) + X p SX p (a') + 6X p (cr) 5X p {a') 
= -Ap + Ap (Ap(a) + Ap(a')) + SX p (a) 6X p (a') . 
der in fluctuations, and we neglect it. Thi 

h mf = +iivj(o) y, a p' x I-Y, E a p ~ x p A >o + 



(7.312) 

The product SX p (a) 6X p (cr') is of second order in fluctuations, and we neglect it. This leads us to the mean field 
Hamiltonian, 



The free energy is then 

/({Ap}, h, e) = | A p X l ~ 6 ln E ex P I \ \T, A p X p x p^) + 



(7.313) 



(7.314) 



The variational parameters are the mean field values {A p }. 
The single site probabilities {x a } are then 



Z 



exp ■ 



^2 A pX P X p (s a ) + ip(s a ) 



(7.315) 



with Z implied by the normalization ^ Q x a = 1. These results reproduce exactly what we found in eqn. 7.306, 
since the mean field equation here, df /dX p = 0, yields 



A' 



Ap — Ap(s a ) x a 



(7.316) 



The free energy is immediately found to be 



f(h,9) = ±J2 A P X l- einZ > ( 7 - 317 ) 

p 

which again agrees with what we found using the variational density matrix. 

Thus, whether one extremizes with respect to the set {x 1: . . . , x K , (}, or with respect to the set {A p }, the results 
are the same, in terms of all these parameters, as well as the free energy f(h, 9). Generically, both approaches may 
be termed 'mean field theory' since the variational density matrix corresponds to a mean field which acts on each 
site independently. 



7.11 Appendix II : Additional Examples 
7.11.1 Blume-Capel model 

The Blume-Capel model provides a simple and convenient way to model systems with vacancies. The simplest 
version of the model is written 

H = - \ E J v S i S > + A E S i ■ ( 7 - 318 ) 
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Figure 7.24: Mean field phase diagram for the Blume-Capel model. The black dot signifies a tricritical point, where 
the coefficients of m 2 and m 4 in the Landau free energy expansion both vanish. The dashed curve denotes a first 
order transition, and the solid curve a second order transition. The thin dotted line is the continuation of the 8 C (5) 
relation to zero temperature. 



The spin variables S { range over the values { — 1, 0, +1}, so this is an extension of the S = 1 Ising model. We 
explicitly separate out the diagonal terms, writing J u = 0, and placing them in the second term on the RHS 
above. We say that site i is occupied if S i = ±1 and vacant if S i = 0, and we identify — A as the vacancy creation 
energy which may be positive or negative, depending on whether vacancies are disfavored or favored in our 
system. 

We make the mean field Ansatz, writing S i = m + SS i . This results in the mean field Hamiltonian, 

H MF = | TV J(0) m 2 - J(0) m ]T S t + A ]T 5? . (7.319) 

i i 

Once again, we adimensionalize, writing / = F/NJ{0), 9 = k B T/ J(0), and 5 = A/ J(0). We assume J(0) > 0. The 
free energy per site is then 

f(6, 6, m) = \m 2 - 9 In (l + 2e~ s/f> cosh(m/6>)) . (7.320) 
Extremizing with respect to m, we obtain the mean field equation, 

2sinh(m/6») 

exp(5/9) + 2 cosh(m/9) ' K ' ' 

Note that m = is always a solution. Finding the slope of the RHS at m = and setting it to unity gives us the 
critical temperature: 

(7.322) 



c exp(<5/0 c ) + 2 ' 
This is an implicit equation for 9 C in terms of the vacancy energy 5. 

Let's now expand the free energy in terms of the magnetization m. We find, to fourth order, 



/ = -01 n (l + 2e-*/«) + 4(0- 



29 V 2 + exp(S/9) 
1 / 6 



m 2 



12(2 + expOV60)6i 3 V 2 + cx p(^) 



(7.323) 



1 )m 4 
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Note that setting the coefficient of the m 2 term to zero yields the equation for 9 C . However, upon further exami- 
nation, we see that the coefficient of the to 4 term can also vanish. As we have seen, when both the coefficients of 
the ra 2 and the m 4 terms vanish, we have a tricritical point 21 . Setting both coefficients to zero, we obtain 

9 t = \ , £ t = fln2. (7.324) 



At 9 = 0, it is easy to see we have a first order transition, simply by comparing the energies of the paramagnetic 
(S i = 0) and ferromagnetic (S^ = +1 or Si = -1) states. We have 

-^ = {° ifm = ° (7.325) 

iVJ(O) [|-A ifm = ±l. 

These results are in fact exact, and not only valid for the mean field theory. Mean field theory is approximate 
because it neglects fluctuations, but at zero temperature, there are no fluctuations to neglect! 

The phase diagram is shown in fig. 7.24. Note that for <5 large and negative, vacancies are strongly disfavored, 
hence the only allowed states on each site have S i = ±1, which is our old friend the two-state Ising model. 
Accordingly, the phase boundary there approaches the vertical line 9 C = 1, which is the mean field transition 
temperature for the two-state Ising model. 



7.11.2 Ising antiferromagnet in an external field 

Consider the following model: 

H = J^V^-if^V, , (7.326) 

with J > and a { — ±1. We've solved for the mean field phase diagram of the Ising ferromagnet; what happens 
if the interactions are antiferromagnetic? 

It turns out that under certain circumstances, the ferromagnet and the antiferromagnet behave exactly the same 
in terms of their phase diagram, response functions, etc. This occurs when H = 0, and when the interactions are 
between nearest neighbors on a bipartite lattice. A bipartite lattice is one which can be divided into two sublattices, 
which we call A and B, such that an A site has only B neighbors, and a B site has only A neighbors. The square, 
honeycomb, and body centered cubic (BCC) lattices are bipartite. The triangular and face centered cubic lattices 
are non-bipartite. Now if the lattice is bipartite and the interaction matrix J- is nonzero only when i and j are 
from different sublattices (they needn't be nearest neighbors only), then we can simply redefine the spin variables 
such that 

I J-rr if n C A 

(7.327) 

Then a^a'j = —cr i <Jy and in terms of the new spin variables the exchange constant has reversed. The thermody- 
namic properties are invariant under such a redefinition of the spin variables. 

We can see why this trick doesn't work in the presence of a magnetic field, because the field H would have to be 
reversed on the B sublattice. In other words, the thermodynamics of an Ising ferromagnet on a bipartite lattice 
in a uniform applied field is identical to that of the Ising antiferromagnet, with the same exchange constant (in 
magnitude), in the presence of a staggered field H A = +H and H B = —H. 




: We should really check that the coefficient of the sixth order term is positive, but that is left as an exercise to the eager student. 
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Figure 7.25: Graphical solution to the mean field equations for the Ising antiferromagnet in an external field, here 
for = 0.6. Clockwise from upper left: (a) h = 0.1, (b) h = 0.5, (c) h = 1.1, (d) h = 1.4. 



We treat this problem using the variational density matrix method, using two independent variational parameters 
m . and m„ for the two sublattices: 



1 + m A 1 - m A 

Qa{(t) = — 2 — ^.1 + — 2 — ■ 1 

fo(cr) = ^ (7,1 + o <V,-1 • 



(7.328) 



2 ^ 2 

With the usual adimensionalization, / = F/NzJ, 9 = k B T/zJ, and h = H/zJ, we have the free energy 

f(m A , m B ) = |m A m B - \ h (m A + m B ) - \B s(m A ) - ± s(m B ) , (7.329) 
where the entropy function is 



Note that 



s(m) = — 



ds 
dm 



1 + m , / 1 + m \ 1 — m / 1 — m 
— - — In — - — H — In 



1 + m 



m 



d 2 s 



dm 2 1 



(7.330) 



(7.331) 
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Figure 7.26: Mean field phase diagram for the Ising antiferromagnet in an external field. The phase diagram is 
symmetric under reflection in the h = axis. 



Differentiating f(m A , m B ) with respect to the variational parameters, we obtain two coupled mean field equations: 



df 
dm A 

df 

dm„ 



2 V 1 - m„ 



2 V 1 - m c 



(7.332) 



Recognizing tanh (a;) = \ In (1 + x)/(l — x) , we may write these equations in an equivalent but perhaps more 
suggestive form: 



= tanh 



h- 



h- 



In other words, the A sublattice sites see an internal field H 



tanh f — . (7.333) 
= — zJm B from their B neighbors, and the B 



sublattice sites see an internal field H„ 



-zJm A from their A neighbors. 



We can solve these equations graphically, as in fig. 7.25. Note that there is always a paramagnetic solution with 
m A = m B = to, where 

1 + to \ . I h — rrC 



to = h In , 

2 V 1 - to 



to = tanh 



(7.334) 



However, we can see from the figure that there will be three solutions to the mean field equations provided that 
< — 1 at the point of the solution where m A = to b = to. This gives us two equations with which to eliminate 
tci A and to b , resulting in the curve 



h * = m+ In 

2 VI — to 



with to = vl 



(7.335) 



Thus, for 9 < 1 and \h\ < h*{9) there are three solutions to the mean field equations. It is usually the case, the 
broken symmetry solutions, which mean those for which m A ^ m B in our case, are of lower energy than the 
symmetric solution(s). We show the curve h* (6) in fig. 7.26. 



We can make additional progress by defining the average and staggered magnetizations to and m s , 



to = h(m A + m B ) 



i(TO A -TO B ) 



(7.336) 
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We expand the free energy in terms of m s : 

f(m, m s ) = im 2 — \m\ — hm — \9 s(m + m a ) — | 6 s(m — m s ) 



2 



m 



hm — 



9 s(m) - \ (l + 9 s" (m)) m 2 s -±9 s""{m) m 4 s + 



(7.337) 



The term quadratic in m s vanishes when 9 s"(m) = — 1, i.e. when m = \/l — 8. It is easy to obtain 

d 3 s 2m d 4 s 2(1 + 3m 2 ) 

dm 3 (1 — to 2 ) 2 dm A (1 — m 2 ) 3 ' 



(7.338) 



from which we learn that the coefficient of the quartic term, ~^9 s"" (m), never vanishes. Therefore the transition 
remains second order down to 9 = 0, where it finally becomes first order. 

We can confirm the 9 — > limit directly. The two competing states are the ferromagnet, with m A ~ m B = ±1, and 
the antiferromagnet, with m A = — m B = ±1. The free energies of these states are 

/ FM = i-/i , / AFM = -I. (7.339) 

There is a first order transition when f™ = f A ™, which yields h = 1. 



7.11.3 Canted quantum antiferromagnet 

Consider the following model for quantum S = | spins: 

# = E [ - J (°f°i + a i a j) + A + l K E °< °i^°f . ( 7 - 34 °) 

<«) WW) 

where cr- is the vector of Pauli matrices on site i. The spins live on a square lattice. The second sum is over all 
square plaquettes. All the constants J, A, and K are positive. 

Let's take a look at the Hamiltonian for a moment. The J term clearly wants the spins to align ferromagnetically 
in the [x, y) plane (in internal spin space). The A term prefers antiferromagnetic alignment along the z axis. The 
K term discourages any kind of moment along z and works against the A term. We'd like our mean field theory 
to capture the physics behind this competition. 

Accordingly, we break up the square lattice into two interpenetrating V2 x V2 square sublattices (each rotated by 
45° with respect to the original), in order to be able to describe an antiferromagnetic state. In addition, we include 
a parameter a which describes the canting angle that the spins on these sublattices make with respect to the A-axis. 
That is, we write 



q a = | + im ( sin aa x + cos a a z ) 



q b = i + |m (sin a a x — cos a a z ) . (7.341) 



Note that Tr g A = Tr g B = 1 so these density matrices are normalized. Note also that the mean direction for a spin 
on the A and B sublattices is given by 

m A B = Tr (g A B cr) = ± m cos a z + m sin a x . (7.342) 

Thus, when a = 0, the system is an antiferromagnet with its staggered moment lying along the z axis. When 
a = 7J7T, the system is a ferromagnet with its moment lying along the x axis. 
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Finally, the eigenvalues of g AB are still A ± = |(1 ± to), hence 

s (to) ee - Tr (g A In g A ) = - Tr (g B In g B ) 



1 + TO , / 1 + TO 

in 



1 — TO / 1 — TO 

In 



(7.343) 



Note that we have taken to a = m B = m, unlike the case of the antiferromagnet in a uniform field. The reason is 
that there remains in our model a symmetry between A and B sublattices. 



The free energy is now easily calculated: 



F = Tr (gH) + k B T Tr (g In g) 



-2N(j sin 2 a + A cos 2 a) to 2 + ^NXm 4 cos 4 a - Nk B T s{m) 



(7.344) 



We can adimensionalize by defining S = A/ J, k = K/AJ, and = k B T/AJ. Then the free energy per site is 

/ = F/ANJ is 

/(to, a) = ~\m 2 + |(l - S) m 2 cos 2 a + iKTO 4 cos 4 a - 6 s{m) . (7.345) 
There are two variational parameters: to and 6. We thus obtain two coupled mean field equations, 



^J- = = —to + (l — 6) m cos 2 a + Km 3 cos 4 a + \& lnf — — — 
dm \ 1 — to 

df ( 2 2 \ 2 

■7— = 0= 1 — <5 + k to cos a to sin a cos a . 

oa V / 



(7.346) 
(7.347) 



Let's start with the second of the mean field equations. Assuming to 7^ 0, it is clear from eqn. 7.345 that 

'0 if 6 < 1 

cos 2 a = ^ (5 - 1)/kto 2 if 1 < S < 1 + kto 2 

1 if (5 > 1 + k to 2 . 

Suppose 5 < 1. Then we have cos a = and the first mean field equation yields the familiar result 

to = tanh (771/0) . 

Along the axis, then, we have the usual ferromagnet-paramagnet transition at C = 1. 



For 1 < 5 < 1 



we have canting with an angle 



*/ x -1 [S^l 

a = a (to) = cos \ 

V K Tfl 



(7.348) 



(7.349) 



(7.350) 

Substituting this into the first mean field equation, we once again obtain the relation to = tanh (to/0). However, 
eventually, as is increased, the magnetization will dip below the value too = \J(6 — 1)/k. This occurs at a 
dimensionless temperature 

0o= "1" 7 <1 : m o = J^- (7-351) 
tann (ttIq) V k 
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Figure 7.27: Mean field phase diagram for the model of eqn. 7.340 for the case k = 1. 



For 8 > 9 , we have 5 > 1 + urn 2 , and we must take cos 2 a = 1. The first mean field equation then becomes 

5m - «m 3 = - In ( l±^.\ (7.352) 
2 y 1 — m J 

or, equivalently, m = tanh ((<5m — urn?)/ 9). A simple graphical analysis shows that a nontrivial solution exists 
provided 9 < S. Since cos a = ±1, this solution describes an antiferromagnet, with m A = ±mz and m B = fm£ 
The resulting mean field phase diagram is then as depicted in fig. 7.27. 



7.11.4 Coupled order parameters 

Consider the Landau free energy 

- ^ " m T 4 <J m in TjU^ T|l/^ "T 2 



/(m, 0) - I a m m 2 + i 6 m m 4 + I a. 2 + \ b 4 4 + \A m 2 </> 2 . (7.353) 



We write 

a rn = a m °m > a = a ^ 7 (7.354) 

where 



T - r - T - T '\ (7.355) 



c,m 



r ^ t 



where T is some temperature scale. We assume without loss of generality that T c m > T c , . We begin by rescaling: 



( t^) 1 fn , 4> = 1 <b . (7.356) 



We then have 

f = e \r (Rn ™ 2 + i ™ 4 ) + r- 1 (f 9^ 2 + \ 4 ) + § Am 2 2 \ , (7.357) 



where 



b, \ 1/2 . ,4 



(&m^) 1/2 ^ VW 0m^) 1/2 
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It proves convenient to perform one last rescaling, writing 

m = r- 1/4 m , = r 1/4 <p . (7.359) 

Then 



/ = £o{ h m m 2 + > 4 + ig- 1 # p 2 + \ V A + \ A m vj 



(7.360) 



where 



1/2 / h ^ 



q=s/7-= p -2. . (7.361) 



Note that we may write 

/(m, = f ( m 2 , 2 ) (I }) (£) + | K ^) (/\) • (7362) 

The eigenvalues of the above 2x2 matrix are 1 ± A, with corresponding eigenvectors ( ±1 ) • Since p 2 > 0, we are 
only interested in the first eigenvector (J), corresponding to the eigenvalue 1 + A. Clearly when A < 1 the free 
energy is unbounded from below, which is unphysical. 

We now set 

|^ = , |^ = 0, (7.363) 
dm dip 

and identify four possible phases: 

• Phase I : m = 0, (p = 0. The free energy is f l = 0. 

• Phase II : m 7^ with tp = 0. The free energy is 

/= |(^ m m 2 + im 4 ) , (7.364) 
hence we require 9 m < in this phase, in which case 



n = V Z *0Z , fu = - E 7Q 2 2 m . (7.365) 



Phase III : m = with tp ^ 0. The free energy is 



2 \* "4>r < 2 

hence we require 9 , < in this phase, in which case 



f = ^(q- 1 0,p 2 + ^^) 1 (7.366) 



<Pm = \[ Z <r l h > /ni = -j?" 2 ^- (7.367) 



Phase IV : m 7^ and ip ^ 0. Varying / yields 



1 A\ /m 2 \ / q 



A 1 J \p 2 ~ \q~ 1 < 



(7.368) 
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with solution 



A 2 - 1 



m 2 = , m g (7 369) 



o 9 1 9,h — 1 S m A 

^ 2 = y H rn {? ^ 

A z — 1 

Since m 2 and <ys 2 must each be nonnegative, phase IV exists only over a yet-to-be-determined subset of the 
entire parameter space. The free energy is 

q 2 e 2 m + q - 2 e 2 - 2A e m e 6 

fry = 4 (A 2-i) • (7 - 371) 

We now define 8 = 8 m and r = 8^ - 8 m = (T c m - T c ^/T . Note that r > 0. There are three possible temperature 
ranges to consider. 

(1) #0 > 8 m > 0. The only possible phases are I and IV. For phase IV, we must impose the conditions m 2 > 
and <fi 2 > 0. If A 2 > 1, then the numerators in eqns. 7.369 and 7.370 must each be positive: 

A < — — , A < -y— => A < mm — — , . (7.372) 

But since either q 2 9 m /9^ or its inverse must be less than or equal to unity this requires A < — 1, which is 
unphysical. 

If on the other hand we assume A 2 < 1, the non-negativeness of m 2 and <p 2 requires 

r0 m V e <h Q 



A > > A > A > max *-a , ] > 1 . (7.373) 



Thus, A > 1 and we have a contradiction. 

Therefore, the only allowed phase for 8 > is phase I. 

(2) 8<p > > 8 m . Now the possible phases are I, II, and IV. We can immediately rule out phase I because f u < f r 
To compare phases II and IV, we compute 

{q\8 m -q- 1 8,) 2 

A/ = / IV - f u = 4 " A2 _ 1} +' ■ (7.374) 

Thus, phase II has the lower energy if A 2 > 1. For A 2 < 1, phase IV has the lower energy, but the conditions 
m 2 > and p 2 > then entail 

^ L<X< 4t- ? 2 KJ>^> 0. (7.375) 



Thus, A is restricted to the range 



A € 



1 , 



? 2 K 



(7.376) 



With 8 m = 8 < and 8 < = 9 + r > 0, the condition q 2 \8 m \ > 9, is found to be 



-r<8< . (7377) 
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Figure 7.28: Phase diagram for r = 0.5, r = 1.5 (top) and r = 0.5, r = 0.25 (bottom). The hatched purple region is 
unphysical, with a free energy unbounded from below. The blue lines denote second order transitions. The thick 
red line separating phases II and III is a first order line. 



Thus, phase IV exists and has lower energy when 



-t < 9 < T — and - 1 < A < ^ , (7.378) 

r + 1 r9 



where r = q 2 . 



(3) > 9^ > 8 m . In this regime, any phase is possible, however once again phase I can be ruled out since phases 
II and III are of lower free energy. The condition that phase II have lower free energy than phase III is 

fu - /m = "-f (<T 2 ^ - Q 2 €) < , (7.379) 

i.e. \9^\ < r\0 m \, which means r\0\ > \8\ — r. If r > 1 this is true for all 9 < 0, while if r < 1 phase II is lower 
in energy only for \9\ < r/(l — r). 

We next need to test whether phase IV has an even lower energy than the lower of phases II and III. We have 

(qX9 m -q- 1 9 A ) 2 
/iv-/n= jTvTTi) ( 7 - 38 °) 

(q8 m -q- 1 \9,) 2 

/iv - fm = m 4(A2 _ 1} " • (7-381) 
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In both cases, phase IV can only be the true thermodynamic phase if A 2 < 1. We then require m 2 > and 
ip 2 > 0, which fixes 

Xe -l,mm(£p2 (7.382) 

The upper limit will be the first term inside the rounded brackets if g 2 |# m | < 6^, i.e. if r\6\ < \6\ — r. This is 
impossible if r > 1, hence the upper limit is given by the second term in the rounded brackets: 



r > 1 : A e 

If r < 1, then the upper limit will be q 2 9 r 
i£\6\ <r/(l-r). 



1 . 



6 



r0 



(condition for phase IV) 



(7.383) 



r6/{6+T) if |0| > T/(l-r), and will be 6>,/y6> m = {6 + T)/r6 



r < 1 , - 



1 -r 



<9<-t : A e 



< 1 , < 



1 -r 



A g 



- 1 



- 1 



r9 
rO 



(phase IV) 
(phase IV) 



(7.384) 
(7.385) 



Representative phase diagrams for the cases r > 1 and r < 1 are shown in fig. 7.28. 
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Chapter 8 

Nonequilibrium Phenomena 



8.1 References 

- H. Smith and H. H. Jensen, Transport Phenomena (Oxford, 1989) 

An outstanding, thorough, and pellucid presentation of the theory of Boltzmann transport in classical and 
quantum systems. 

- P. L. Krapivsky, S. Redner, and E. Ben-Nairn, A Kinetic View of Statistical Physics (Cambridge, 2010) 
Superb, modern discussion of a broad variety of issues and models in nonequilibrium statistical physics. 

- E. M. Lifshitz and L. P. Pitaevskii, Physical Kinetics (Pergamon, 1981) 

Volume 10 in the famous Landau and Lifshitz Course of Theoretical Physics. Surprisingly readable, and 
with many applications (some advanced). 

- M. Kardar, Statistical Physics of Particles (Cambridge, 2007) 

A superb modern text, with many insightful presentations of key concepts. Includes a very instructive 
derivation of the Boltzmann equation starting from the BBGKY hierarchy. 

- J. A. McLennan, Introduction to Non-equilibrium Statistical Mechanics (Prentice-Hall, 1989) 
Though narrow in scope, this book is a good resource on the Boltzmann equation. 

- F. Reif, Fundamentals of Statistical and Thermal Physics (McGraw-Hill, 1987) 

This has been perhaps the most popular undergraduate text since it first appeared in 1967, and with good 
reason. The later chapters discuss transport phenomena at an undergraduate level. 

- N. G. Van Kampen, Stochastic Processes in Physics and Chemistry (3 rd edition, North-Holland, 2007) 
This is a very readable and useful text. A relaxed but meaty presentation. 
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8.2 Equilibrium, Nonequilibrium and Local Equilibrium 

Classical equilibrium statistical mechanics is described by the full TV -body distribution, 

' z n 1 ■ ^e-^(P.aO OCE 
f°(x 1 ,...,x N ;p 1 ,...,p N ) = { (8.1) 



We assume a Hamiltonian of the form 



Pi 

typically with v = 0, i.e. only two-body interactions. The quantity 



N 2 N N 



r(x 1) ...,x N ;p 1 ,...,p N ) — d fjT^ (8 - 3) 

is the probability under equilibrium conditions, of finding N particles in the system, with particle #1 lying within 
d 3 x 1 of x 1 and having momentum within d d p 1 of p x , etc. The temperature T and chemical potential /i are constants, 
independent of position. Note that /({cCj}, {p^}) is dimensionless. 

Nonequilibrium statistical mechanics seeks to describe thermodynamic systems which are out of equilibrium, 
meaning that the distribution function is not given by the Boltzmann distribution above. For a general nonequilib- 
rium setting, it is hopeless to make progress - we'd have to integrate the equations of motion for all the constituent 
particles. However, typically we are concerned with situations where external forces or constraints are imposed 
over some macroscopic scale. Examples would include the imposition of a voltage drop across a metal, or a tem- 
perature differential across any thermodynamic sample. In such cases, scattering at microscopic length and time 
scales described by the mean free path I and the collision time t work to establish local equilibrium throughout the 
system. A local equilibrium is a state described by a space and time varying temperature T(r, t) and chemical 
potential fi(r, t). As we will see, the Boltzmann distribution with T = T(r, t) and fx = fi(r, t) will not be a solution 
to the evolution equation governing the distribution function. Rather, the distribution for systems slightly out of 
equilibrium will be of the form f = f° + Sf, where f° describes a state of local equilibrium. 

We will mainly be interested in the one-body distribution 

N 

f(r,p;t) = y £(S(x i (t) r)5( Pi (t) p) ) 

1=1 (8.4) 
= N j'[\d d x l d d p l f(r,x 2 ,.. .,x N ;p,p 2 , ...,p N ;t). 



i=2 



In this chapter, we will drop the l/K normalization for phase space integration. Thus, f(r,p, t) has dimensions of 
h~ d , and f(r,p, t) d\ d^p is the average number of particles found within d^r of r and d 3 p of p at time t. 

In the GCE, we sum the RHS above over N. Assuming v = so that there is no one-body potential to break 
translational symmetry, the equilibrium distribution is time-independent and space-independent: 

/°(r, p)=n (27rmfc B T)- 3 / 2 e~ p ' 1 / 2mk * T , (8.5) 



where n = N/V orn = n(T, fj) is the particle density in the OCE or GCE. From the one-body distribution we can 
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compute things like the particle current, j, and the energy current, j £ : 

j(r,t)= [d*pf(r,p;t)?- (8.6) 
J m 

j £ (r,t)= fd cl pf(r 7 p;t)e(p)^- , (8.7) 
J m 

where e(p) = p 2 /2m. Clearly these currents both vanish in equilibrium, when / = f°, since f°(r,p) depends 
only on p 2 and not on the direction of p. In a steady state nonequilibrium situation, the above quantities are 
time-independent. 

Thermodynamics says that 

dq = T ds = de — j_idn , (8.8) 

where s, e, and n are entropy density, energy density, and particle density, respectively, and dq is the differential 
heat density. This relation may be case as one among the corresponding current densities: 

j q = Tj s =j £ -vj ■ (8-9) 

Thus, in a system with no particle flow, j = and the heat current j q is the same as the energy current j e . 

When the individual particles are not point particles, they possess angular momentum as well as linear momen- 
tum. Following Lifshitz and Pitaevskii, we abbreviate T = (p,L) for these two variables for the case of diatomic 
molecules, and T = (p, L, h ■ L) in the case of spherical top molecules, where n is the symmetry axis of the top. 
We then have, in d = 3 dimensions, 

{d^p point particles 

d^p L dL df2 L diatomic molecules (8.10) 

d^p L 2 dL dfi L d cos ■& symmetric tops , 

where $ = cos _1 (n • L). We will call the set r the 'kinematic variables'. The instantaneous number density at r is 
then 

n(r,t) = Jdrf(r,r ; t). (8.11) 

One might ask why we do not also keep track of the angular orientation of the individual molecules. There are 
two reasons. First, the rotations of the molecules are generally extremely rapid, so we are justified in averaging 
over these motions. Second, the orientation of, say, a rotor does not enter into its energy. While the same can be 
said of the spatial position in the absence of external fields, (i) in the presence of external fields one must keep 
track of the position coordinate r since there is physical transport of particles from one region of space to another, 
and (iii) the collision process, which as we shall see enters the dynamics of the distribution function, takes place 
in real space. 



8.3 Boltzmann Transport Theory 

8.3.1 Derivation of the Boltzmann equation 

For simplicity of presentation, we assume point particles. Recall that 

. ,o I # of particles with positions within d 3 r of 

f(r : p,t)d 3 rd 3 p= { F F (8.12) 

I r and momenta within d p of p at time t. 
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We now ask how the distribution functions f{r,p, t) evolves in time. It is clear that in the absence of collisions, 
the distribution function must satisfy the continuity equation, 

^ + V-(w/) = 0. (8.13) 

This is just the condition of number conservation for particles. Take care to note that V and u are sz'x-dimensional 
phase space vectors: 

u = ( x , y , z , p x , p v , p z ) (8.14) 
d d d d d d 



dx' dy ' dz' dp x ' dp v ' dp z 



(8.15) 



The continuity equation describes a distribution in which each constituent particle evolves according to a pre- 
scribed dynamics, which for a mechanical system is specified by 

dr dH . . dp dH /f) „ x 

where F is an external applied force. Here, 

H(p,r)=e(p) + U eyLt {r). (8.17) 

For example, if the particles are under the influence of gravity, then U ex t{r) = mg ■ r and F = — Vf/ cxt = —mg. 

Note that as a consequence of the dynamics, we have V • u = 0, i.e. phase space flow is incompressible, provided 
that e(p) is a function of p alone, and not of r . Thus, in the absence of collisions, we have 

%+u-Vf = 0. (8.18) 
at 

The differential operator D t = dt + u ■ V is sometimes called the 'convective derivative', because D t f is the time 
derivative of / in a comoving frame of reference. 

Next we must consider the effect of collisions, which are not accounted for by the semiclassical dynamics. In a 
collision process, a particle with momentum p and one with momentum p can instantaneously convert into a pair 
with momenta p' and pi , provided total momentum is conserved: p + p = p' + p' . This means that D t f ^ 0. 
Rather, we should write 



Of _ df df fdf\ 
tt- + r ■ — + v- — = — 



dt dr dp \dt J 



+ *-lt+P-lt = llt (8-19) 



coll 



where the right side is known as the collision integral. The collision integral is in general a function of r, p, and t 
and afunctional of the distribution /. 



After a trivial rearrangement of terms, we can write the Boltzmann equation as 

(8.20) 



dt -\dt) stI + \dt) 



coll 



where 



dt J dr dp 



is known as the streaming term. Thus, there are two contributions to df /dt : streaming and collisions. 
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8.3.2 Collisionless Boltzmann equation 

In the absence of collisions, the Boltzmann equation is given by 



df de 9/_ Vf/ df 
&t dp dr oxt dp 



+ — VC/ cxt ■ -5Z = . (8.22) 



In order to gain some intuition about how the streaming term affects the evolution of the distribution f(r,p, t), 
consider a case where F cxt = 0. We then have 

df , p df 

Clearly, then, any function of the form 



0. (8.23) 
dt m dr 



f(r,p,t)=tp(r-v{p)t,p) (8.24) 

will be a solution to the collisionless Boltzmann equation, where v(p) = J|. One possible solution would be the 
Boltzmann distribution, 

f(r, p, t) = e ^/h B T e -p 2 /2rnk B T ^ ( g 25) 

which is time-independent 1 . Here we have assumed a ballistic dispersion, e(p) = p 2 /2m. 

For a slightly less trivial example, let the initial distribution be <p(r, p) = A e~ r I 2g e~ p I 2k , so that 

f(r,p,t) = Ae-^)' '~ /2 ° 2 e-v-l 2 ** . (8.26) 
Consider the one-dimensional version, and rescale position, momentum, and time so that 

f(x,p,t) = Ae-i^-P^ 2 e-^ . (8.27) 
Consider the level sets of /, where f(x, p, t) = A e~ 3 a . The equation for these sets is 

x = pt± yja 2 ~p 2 . (8.28) 

For fixed t, these level sets describe the loci in phase space of equal probability densities, with the probability 
density decreasing exponentially in the parameter a 2 . For t = 0, the initial distribution describes a Gaussian 
cloud of particles with a Gaussian momentum distribution. As i increases, the distribution widens in x but not 
in p - each particle moves with a constant momentum, so the set of momentum values never changes. However, 
the level sets in the (x , p) plane become elliptical, with a semimajor axis oriented at an angle 9 = ctn _1 (£) with 
respect to the x axis. For t > 0, he particles at the outer edges of the cloud are more likely to be moving away from 
the center. See the sketches in fig. 8.1 

Suppose we add in a constant external force -F oxt . Then it is easy to show (and left as an exercise to the reader to 
prove) that any function of the form 

f(r,p,t)=AJr-^ + ^f^,p-^) (8.29) 
\ m Ira m J 

satisfies the collisionless Boltzmann equation (ballistic dispersion assumed). 



1 Indeed, any arbitrary function of p alone would be a solution. Ultimately, we require some energy exchanging processes, such as collisions, 
in order for any initial nonequilibrium distribution to converge to the Boltzmann distribution. 
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Figure 8.1: Level sets for a sample f(x,p,t) = Ae~^ x ~^> e~^ p , for values / = Ae~^ a ~ with a. in equally spaced 
intervals from a = 0.2 (red) to a = 1.2 (blue). The time variable i is taken to be t = 0.0 (upper left), 0.2 (upper 
right), 0.8 (lower right), and 1.3 (lower left). 



8.3.3 Collisional invariants 

Consider a function A(r, p) of position and momentum. Its average value at time t is 

A{t) = J d 3 r d 3 P A(r,p) f(r,p,t). 

Taking the time derivative, 

-/**^'){-S-^-5-» fl + (iL} 

Hence, if A is preserved by the dynamics between collisions, then 2 

dA _ dA dr dA dp _ 
dt dr dt dp dt 



(8.30) 



(8.31) 



(8.32) 



2 Recall from classical mechanics the definition of the Poisson bracket, {A, B} = ■ ^ — ^ ■ ^ . Then from Hamilton's equations r = 0^ 



op 



and p = —^pf, where H(p, T, t) is the Hamiltonian, we have 4f- = {A H}. Invariants have zero Poisson bracket with the Hamiltonian. 
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We therefore have that the rate of change of A is determined wholly by the collision integral 

Quantities which are then conserved in the collisions satisfy A = 0. Such quantities are called collisional invariants. 
Examples of collisional invariants include the particle number (A = 1), the components of the total momentum 
(A = Pn) (i n the absence of broken translational invariance, due e.g. to the presence of walls), and the total energy 
(A = e(p)). 



8.3.4 Scattering processes 

What sort of processes contribute to the collision integral? There are two broad classes to consider. The first 
involves potential scattering, where a particle in state \T) scatters, in the presence of an external potential, to a 
state \r'). Recall that T is an abbreviation for the set of kinematic variables, e.g. T = (p, L) in the case of a diatomic 
molecule. For point particles, T = (p x ,p y ,p z ) and dr = d 3 p. 

We now define the function w(r'\r) such that 

w(r'\r) f(r, r ; t) dr dr> = j rate at which a particle within dr of (r ' r) (8.34) 

v 1 ' K ' | scatters to within dr' of (r,r') at time t. 

The units of w dr are therefore 1 /T. The differential scattering cross section for particle scattering is then 

da = W ( F lp dr' , (8.35) 
n \v\ 

where v = p/m is the particle's velocity and n the density. 

The second class is that of two-particle scattering processes, i.e. IIT^) — ► \rT{). We define the scattering function 

w (rT{ | rr x ) by 

{rate at which two particles within dr of (r, T) 
and within of (r , ^ ) scatter into states within (8.36) 
dr' of (r, r') and dr{ of (r, T{) at time t, 

where 

f 2 (r,p; r',p';t) = ( ^ 6( Xi (t) - r) 5{ Pl {t) - p) 5{ Xj (t) - r') 6( Pj (t) -p') ) (8.37) 
is the nonequilibrium two-particle distribution for point particles. The differential scattering cross section is 

w(r'ri\rr) , , 
da = — 11 , v dr' dr . 8.38) 

\v- Vl \ 

We assume, in both cases, that any scattering occurs locally, i.e. the particles attain their asymptotic kinematic states 
on distance scales small compared to the mean interparticle separation. In this case we can treat each scattering 
process independently. This assumption is particular to rarefied systems, i.e. gases, and is not appropriate for 
dense liquids. The two types of scattering processes are depicted in fig. 8.2. 
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In computing the collision integral for the state \r,T), we must take care to sum over contributions from transitions 
out of this state, i.e. \T) — > | r'), which reduce f(r, r), and transitions into this state, i.e. \r') — ► \r), which increase 
/(r, F). Thus, for one-body scattering, we have 



^f( r ,r ] t)=(j^j = fdr' {w(r\r')f(r,r'-t)-w(r'\r)f(r,r-t)]. (8.39) 

For two-body scattering, we have 



= jdr.Jdr'jdr', {wirr.ir'rl) f 2 (r,r';r,r{ ; t) ( 8 - 4 °) 

Unlike the one-body scattering case, the kinetic equation for two-body scattering does not close, since the LHS 

involves the one-body distribution / = f 1 and the RHS involves the two-body distribution / 2 . To close the 
equations, we make the approximation 

f 2 (r, r'; f , f ; f) « f(r, T; t) f(f, f; t) . (8.41) 

We then have 

^ /(r, r ; *) = fdrjdr'fdri {w^r, \ r'r[) f( r , r'- 1) f( r , r[-t) 

Dt J J J ( 8 .42) 

-wirTiirr^fir^-t)/^,^^)} . 



8.3.5 Detailed balance 

Classical mechanics places some restrictions on the form of the kernel w;(i _ T' 1 \rT[\. In particular, if _T T = 
(— p, —L) denotes the kinematic variables under time reversal, then 



w(r'r[ | m) = w (r T r? \ r' T r[ T ) 



(8.43) 
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This is because the time reverse of the process \rrA -> \rT[) is \r' T r[ T ) -> \r T T?). 
In equilibrium, we must have 

w(r'r[ | rn) /°(r) /°(ri) d*r = ^(r^rf | r' T r{ T ) /°(r' r ) / (rD d 4 r T (8.44) 

where 

d 4 r = rfr dTj dr'dr; , d A r T = dr T dr[ dr IT dr[ T . (8.45) 

Since dr = dT T etc., we may cancel the differentials above, and after invoking eqn. 8.43 and suppressing the 
common r label, we find 

f{r) fir,) = f(r' T ) f°(rn . (8.46) 

This is the condition of detailed balance. For the Boltzmann distribution, we have 

/°(r)=4e^B T , (8.47) 

where A is a constant and where e = e(T) is the kinetic energy, e.g. e(r) = p 2 /2m in the case of point particles. 
Note that s(r T ) = e(r). Detailed balance is satisfied because the kinematics of the collision requires energy 
conservation: 

e + e 1 =e' + e[ . (8.48) 
Since momentum is also kinematically conserved, i.e. 

P + Pi=p'+Pi, (8.49) 

any distribution of the form 

f°(r) = Ae- (£ - p - v V k * T (8.50) 

also satisfies detailed balance, for any velocity parameter V. This distribution is appropriate for gases which are 
flowing with average particle V. 

In addition to time-reversal, parity is also a symmetry of the microscopic mechanical laws. Under the parity 
operation P, we have r — > r and p — ► p. Note that a pseudovector such as L = r x p is unchanged under 
P. Thus, r F = (—p, L). Under the combined operation of C = PT, we have r° = (p, —L). If the microscopic 
Hamiltonian is invariant under C, then we must have 

w(r'r[ I r/\) = w(r°r? | r' c r[ c ) . (8.5i) 

For point particles, invariance under T and P then means 

w(p',Pi \ p,Pi) =w(p,p 1 \p',Pi) , (8.52) 
and therefore the collision integral takes the simplified form, 

Df(p) _ (df\ 
Dt \dtJ coU 

fd^Jd^Jd^ w(j/,p{ I P , Pl ) {f( P ') Hp',) - f( P ) f( Pl )} , 

where we have suppressed both r and t variables. 
The most general statement of detailed balance is 

/ o (r)/0(r{) ^ ^r'rurrj 

Under this condition, the collision term vanishes for / = f°, which is the equilibrium distribution. 



(8.53) 



(8.54) 
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8.3.6 Kinematics and cross section 

We can rewrite eqn. 8.53 in the form 



Dt 



jd 3 p 1 jdf2\v-v 1 \^ j {f(p')f(p[)-f(p)f(p 1 )} , (8.55) 



where ^ is the differential scattering cross section. If we recast the scattering problem in terms of center-of-mass 
and relative coordinates, we conclude that the total momentum is conserved by the collision, and furthermore that 
the energy in the CM frame is conserved, which means that the magnitude of the relative momentum is conserved. 
Thus, we may write p' — p\ = \p — p x \ !2, where J? is a unit vector. Then p' and p\ are determined to be 

p' =\{p + p x + \p- Pl \fi) 

p'i = W + Pi - \p-Pi\n) . 



8.3.7 H-theorem 

Let's consider the Boltzmann equation with two particle collisions. We define the local (i.e. r-dependent) quantity 

P<p (r, t) = Jdr tpir, /) f(r, r, t) . (8.57) 

At this point, tp(r, f) is arbitrary. Note that the <p(T, f) factor has r and t dependence through its dependence on 
/, which itself is a function of r, r, and t. We now compute 

9P V _ f dr d^pf) = r d(<pf) df 



dt J dt J df dt 



The first term on the last line follows from the divergence theorem, and vanishes if we assume / = for infinite 
values of the kinematic variables, which is the only physical possibility. Thus, the rate of change of p v is entirely 
due to the collision term. Thus, 



dt 



drjdrjdr'jdri {w(r'r[ \ rr x ) /.a x - wirr, \ r'r[) ff[ x ) 
= Jdr J drj, dr'J dr[ w (r'r[ \ rr, ) /a ( x -x'), 



(8.59) 



where / = f(r), f = f(r>), f l = /(A), f{ = f{r[), x = X(n with 

X = — =P + fgj. (8.60) 

We now invoke the symmetry 

w(r'r{ | rr t ) = w(r{ r' \r l r) , (8.6i) 

which allows us to write 

^ = I j drjdrjdr'jdri w{r'r[ \ rr,) ff, ( x + xi- x' - A) ■ (8.62) 
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This shows that p v is preserved by the collision term if x(r) is a collisional invariant. 
Now let us consider ip(f) = In/. We define h = p\ _ ln „. We then have 



ah 



\jdrj dr x J dT'J dr[ w ff[ -xlnx, (8.63) 
where w = w(rT{ \ rr x ) and x = ffi/f'f[. We next invoke the result 

Jdr'Jdr[ w(r'r{ \ r/\) = J dr 'J dr l ™{ rr i I r'r{) (8.64) 

which is a statement of unitarity of the scattering matrix 3 . Multiplying both sides by f(T) /(.Ti), then integrating 
over r and I\, and finally changing variables (r, i^) <-> (J 1 ', i"j), we find 

o = JdrJ dr,J dr'J dr{ w (/a - /'/{) = J dr J dr iJ dr 'f dr l w f'f'i (8-65) 

Multiplying this result by \ and adding it to the previous equation for h, we arrive at our final result, 

^ = -±Jdrj dr ± J dr'J dr[ w f'f[ (x In x - x + 1) . (8.66) 

Note that w, f, and f[ are all nonnegative. It is then easy to prove that the function g(x) ~ xlnx — x + 1 is 
nonnegative for all positive x values 4 , which therefore entails the important result 

2^<0. ,8.67) 
Boltzmann's H function is the space integral of the h density: H = J d?r h. 

Thus, everywhere in space, the function h(r,£) is monotonically decreasing or constant, due to collisions. In 
equilibrium, h = everywhere, which requires x = 1, i.e. 

f°(r)f(r 1 ) = f(r')f(r[), (8.68) 

or, taking the logarithm, 

ln/ (r)+W (A) = bxf(r')+\nf(ri) . (8.69) 

But this means that In f° is itself a collisional invariant, and if 1, p, and e are the only collisional invariants, then 
In f° must be expressible in terms of them. Thus, 

ln/° = -^- + ^--i-, (8.70) 

k^T k^T k^T 

where /j, V, and T are constants which parameterize the equilibrium distribution f°(p), corresponding to the 
chemical potential, flow velocity, and temperature, respectively. 



3 See Lifshitz and Pitaevskii, Physical Kinetics, §2. 

4 The function g(x) = x\nx — x + 1 satisfies g'(x) = lnx, hence g'(x) < on the interval x £ [0, 1) and g'(x) > on x S (1, oo]. Thus, 
g(x) monotonically decreases from 3(0) = 1 to g(l) = 0, and then monotonically increases to g(oo) = oo, never becoming negative. 
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8.4 Weakly Inhomogeneous Gas 



Consider a gas which is only weakly out of equilibrium. We follow the treatment in Lifshitz and Pitaevskii, §6. As 
the gas is only slightly out of equilibrium, we seek a solution to the Boltzmann equation of the form / = f + Sf, 
where f is describes a local equilibrium. Recall that such a distribution function is annihilated by the collision 
term in the Boltzmann equation but not by the streaming term, hence a correction Sf must be added in order to 
obtain a solution. 



The most general form of local equilibrium is described by the distribution 

/V,r) = Cexp(^g±^ 
where \i = [i(r, t), T = T(r, t), and V = V(r, t) vary in both space and time. Note that 

df = (dft+p- dV + ie-v-V-p)^- de^j (- 

= (^d P+ p.dV + (e-h)§-de >S j (-^ 
where we have assumed V = on average, and used 



T 



= — s dT H — dp , 
n 



(8.71) 



(8.72) 



(8.73) 



where s is the entropy per particle and n is the number density. We have further written h = fJ, + Ts, which is the 
enthalpy per particle. Here, c p is the heat capacity per particle at constant pressure 5 . Finally note that when f° is 
the Maxwell-Boltzmann distribution, we have 

df f° 

^ = It- (8 - 74) 



The Boltzmann equation is written 



dt m dr 



'■£)e° + *>-(ffL 



(8.75) 



The RHS of this equation must be of order Sf because the local equilibrium distribution f is annihilated by the 
collision integral. We therefore wish to evaluate one of the contributions to the LHS of this equation, 



df 
dt 



m dr dp 



df 
de 



1 dp 
n dt 



e-hdT 



T dt 



mv- 



(v-V)V 



dV 1 „ 

m -7— + - Vp 

dt n 



(8.76) 



T 



v-VT -Fv 



5 In the chapter on thermodynamics, we adopted a slightly different definition of c p as the heat capacity per mole. In this chapter c p is the 
heat capacity per particle. 
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To simplify this, first note that Newton's laws applied to an ideal fluid give pV = — Vp, where p = mn is the mass 
density. Corrections to this result, e.g. viscosity and nonlinearity in V, are of higher order. 

Next, continuity for particle number means h + V ■ (nV) = 0. We assume V is zero on average and that all 
derivatives are small, hence V • (riV) = V ■ Vn + nV-V«nV-V\ Thus, 

^ = ^-^ = -V.V, (8.77) 

ot ot ot 

where we have invoked the ideal gas law n = p/k B T above. 

Next, we invoke conservation of entropy. If s is the entropy per particle, then ns is the entropy per unit volume, 
in which case we have the continuity equation 

2M + V .( nsV ) = n (^ + v-Vs^+s(^ + V-(nV)\ =0. (8.78) 

The second bracketed term on the RHS vanishes because of particle continuity, leaving us with s + V- Vs « s = 
(since V = on average, and any gradient is first order in smallness). Now thermodynamics says 

T p 



c p dlnT dlnp 



Ot Cy 



Thus eqn. 8.76 becomes 



<9/° P df° „ df° ( df°\\e(r)-h 



where 



h - Tc„ - e(r) 

Cy/K B 



Udv dv 



Therefore, the Boltzmann equation takes the form 



T " " " c v /k B ' ' ~ "\k B T' dt \dtj c 



(8.79) 



since T(J£) p - c p and (§) T = where v = V/N. Thus, 

= . (8.80) 



k B dt dt 

We now have in eqns. 8.77 and 8.80 two equations in the two unknowns 9l g t T and dl Q t p , yielding 

^ = -^V-F (8.81) 

Ot Cy 

dlnp iv-V. (8.82) 



(8.83) 



. - VT + Cag - £<r) - » + TC * V V - F ■ 4 JL + - m . ,8.85) 
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Notice we have dropped the terms v ■ and F ■ since 5f must already be first order in smallness, and both 

the operator as well as F add a second order of smallness, which is negligible. Typically is nonzero if 
the applied force F(t) is time-dependent. We use the convention of summing over repeated indices. Note that 
Qufi = Qaa = V- V. For ideal gases in which only translational and rotational degrees of freedom are excited, 

h = c p T. 



8.5 Relaxation Time Approximation 

8.5.1 Approximation of collision integral 

We now consider a very simple model of the collision integral, 

%) -LJL-*. (8.86) 

9t /coll T T 

This model is known as the relaxation time approximation. Here, f° = f°(r,p, t) is a distribution function which 
describes a local equilibrium at each position r and time t. The quantity r is the relaxation time, which can in 
principle be momentum-dependent, but which we shall first consider to be constant. In the absence of streaming 
terms, we have 

^S = -~ => tf(r,p,t) = <y/(r,p,0)e-*/ T . (8.87) 
at t 

The distribution / then relaxes to the equilibrium distribution f° on a time scale r. We note that this approximation 
is obviously flawed in that all quantities - even the collisional invariants - relax to their equilibrium values on the 
scale t. In the Appendix, we consider a model for the collision integral in which the collisional invariants are all 
preserved, but everything else relaxes to local equilibrium at a single rate. 



8.5.2 Computation of the scattering time 

Consider two particles with velocities v and v' . The average of their relative speed is 

(|t; -«'|>= fd 3 vfd 3 v'P(v)P{v')\v-v'\, (8.88) 
where P(v) is the Maxwell velocity distribution, 

-3/2 

2itKT J V 2KT 



P(v)=[w^) exp(--f-), (8.89) 



which follows from the Boltzmann form of the equilibrium distribution /°(p). It is left as an exercise for the 
student to verify that 

4 fk T\ 1/2 

5b)1 =(|„-«'|> = _(^-) . (8.90) 
V7T V m J 

Note that v iel = \/2 v, where v is the average particle speed. Let a be the total scattering cross section, which for 
hard spheres is a = nd 2 , where d is the hard sphere diameter. Then the rate at which particles scatter is 

- = nv rcl a. (8.91) 
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< V ,T ► 

rel 

Figure 8.3: Graphic representation of the equation n a v xe \ T = 1, which yields the scattering time r in terms of the 
number density n, average particle pair relative velocity v IC \, and two-particle total scattering cross section a. The 
equation says that on average there must be one particle within the tube. 

The particle mean free path is simply 

i = vr = —^ — . (8.92) 

V2 n a 

While the scattering length is not temperature-dependent within this formalism, the scattering time is T-dependent, 
with 

1 v 7r / m x 



r(T) = —_ = j—[ — \ . (8.93) 

nv rcl a Ana \ k B l J 

As T — > 0, the collision time diverges as t oc T -1 / 2 , because the particles on average move more slowly at lower 
temperatures. The mean free path, however, is independent of T , and is given by I = 1/ \/2na. 



8.5.3 Thermal conductivity 

We consider a system with a temperature gradient VT and seek a steady state {i.e. time-independent) solution 
to the Boltzmann equation. We assume F a = Q a/3 = 0. Appealing to eqn. 8.85, and using the relaxation time 
approximation for the collision integral, we have 

t(s — c ri T) n 
5f = - \ T P 2 (v ■ VT) f° . (8.94) 

We are now ready to compute the energy and particle currents. In order to compute the local density of any quantity 
A(r,p), we multiply by the distribution f(r,p) and integrate over momentum: 

p A (r,t) = J d 3 p A(r, p)f(r, p, t) , (8.95) 

For the energy (thermal) current, we let A = ev a = e p a /m, in which case p A = j a . Note that Jd 3 ppf° = since f° 
is isotropic in p even when p and T depend on r. Thus, only 5f enters into the calculation of the various currents. 
Thus, the energy (thermal) current is 

j e »= fd 3 pev a Sf 

w (8.96) 



k B T 2 x v p ' 1 dx? ' 

where the repeated index (3 is summed over, and where momentum averages are defined relative to the equilib- 
rium distribution, i.e. 

(4>(p)) = [d 3 pcf>(p)f a (p) / [d 3 pf (p) = [d 3 vP(v)<f>(mv) . (8.97) 
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In this context, it is useful to point out the identity 

d 3 pf°{p)=nd 3 vP(v) , (8.98) 

where 

P(v) = (^t) 3/2 e~^~ v ^ T (8.99) 

is the Maxwell velocity distribution. 

Note that if = 0(e) is a function of the energy, and if V = 0, then 

d 3 p f°(p) = n d 3 v P(v) = n P(e) de , (8.100) 

where 

P(e) = ^=(k B T)~ 3 / 2 a 1 ' 2 e~^ k * T , (8.101) 

oo 

is the Maxwellian distribution of single particle energies. This distribution is normalized with J de P(e) = 1. 

o 

Averages with respect to this distribution are given by 

oo oo 

(0(e)) = Jde4>(e)P(e) = -^(k B T)- 3/2 Jde e 1 ' 2 0(e) e~^ T . (8.102) 
o o 

If 0(e) is homogeneous, then for any a we have 

(e«) = ^r(a+|)(fc B rr • (8.103) 

Due to spatial isotropy, it is clear that we can replace 

V " v ^±v 2 6 a0 = ^-6 a0 (8.104) 
in eqn. 8.96. We then have j E = —k VT, with 

■ 2( e-c p T)) = h -^ = ^vc p , (8.105) 



3mfc B T 2 x v p ' ' 2m 
where we have used c p = |fc B and v 2 = 8 ^ ra T . The quantity k is called the thermal conductivity. Note that k oc T 1 / 2 . 

8.5.4 Viscosity 

Consider the situation depicted in fig. 8.4. A fluid filling the space between two large flat plates at z = and 
z = d is set in motion by a force F = Fx applied to the upper plate; the lower plate is fixed. It is assumed that the 
fluid's velocity locally matches that of the plates. Fluid particles at the top have an average x-component of their 
momentum (p x ) = mV. As these particles move downward toward lower z values, they bring their x-momenta 
with them. Therefore there is a downward (— ^-directed) flow of (p x ). Since a; -momentum is constantly being 
drawn away from z = d plane, this means that there is a — cc-directed viscous drag on the upper plate. The viscous 
drag force per unit area is given by f<j rag /^4 = —r/V/d, where V/d = dV x /dz is the velocity gradient and ?y is the 
shear viscosity. In steady state, the applied force balances the drag force, i.e. F + F drag = 0. Clearly in the steady 
state the net momentum density of the fluid does not change, and is given by jpVx, where p is the fluid mass 
density. The momentum per unit time injected into the fluid by the upper plate at z — d is then extracted by the 
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z = d 

v x = v 



z 



Figure 8.4: Gedankenexperiment to measure shear viscosity 77 in a fluid. The lower plate is fixed. The viscous drag 
force per unit area on the upper plate is F^ing/A = — pV/ d. This must be balanced by an applied force F. 




lower plate at z = 0. The momentum flux density U xz = n{p x v z ) is the drag force on the upper surface per unit 
area: H xz = —p ^Ss The units of viscosity are [p] = M/LT. 

We now provide some formal definitions of viscosity. As we shall see presently, there is in fact a second type of 
viscosity called second viscosity or bulk viscosity, which is measurable although not by the type of experiment 
depicted in fig. 8.4. 

The momentum flux tensor TL aj3 = n(p a Vg) is defined to be the current of momentum component p a in the 
direction of increasing x^. For a gas in motion with average velocity V, we have 

n a p=nm((V a + v' a )(Vp + v'p)) 

= nm V a V g + nm ( v' a v' p ) ,„.,„,, 
2 (8.106) 
= nmV a V + \nm{v ) S ap 

where v' is the particle velocity in a frame moving with velocity V, and where we have invoked the ideal gas law 
p = nk s T. The mass density is p = nm. 

When V is spatially varying, 

n Q/3 = pS a[3 + P V a V p - a af3 , (8.107) 

where a a ^ is the viscosity stress tensor. Any symmetric tensor, such as cr Q/3 , can be decomposed into a sum of 
(i) a traceless component, and (ii) a component proportional to the identity matrix. Since a a g should be, to first 
order, linear in the spatial derivatives of the components of the velocity field V, there is a unique two-parameter 
decomposition: 

( dV dVg - \ 

\ dx l3 dx a J (8.108) 

= 27? (Q a p ~ \ Tr (Q) 5 J) + C Tr (Q) 5 af} . 



The coefficient of the traceless component is p, known as the shear viscosity. The coefficient of the component 
proportional to the identity is Q, known as the bulk viscosity. The full stress tensor a a/3 contains a contribution from 
the pressure: 

° a {3 = ~P S al3 + V a! 3 ■ (8.109) 

The differential force dF a that a fluid exerts on on a surface element n dA is 

dF a = -a a/3 n fj dA , (8.110) 
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T 1 I I I 1 




rEMP x 10"*, *K 



Figure 8.5: Left: thermal conductivity (A in figure) of Ar between T = 800 K and T = 2600 K. The best fit to a 
single power law A = aT b results in b = 0.651. Source: G. S. Springer and E. W. Wingeier, /. Chem Phys. 59, 1747 
(1972). Right: log-log plot of shear viscosity (/i in figure) of He between T k 15K and T m 1000 K. The red line 
has slope \. The slope of the data is approximately 0.633. Source: J. Kestin and W. Leidenfrost, Physica 25, 537 
(1959). 



where we are using the Einstein summation convention and summing over the repeated index (3. We will now 
compute the shear viscosity r) using the Boltzmann equation in the relaxation time approximation. 

Appealing again to eqn. 8.85, with F = and h = c p T, we find 

Sf = ~^{mv a v Q ap + £Z3£ „ . VT - V- f . (8.111) 

We assume VT = V- V = 0, and we compute the momentum flux: 



U xz = n Jd 3 pp x v z Sf 



KT 



Thus, if V x = V x (z), we have 



from which we read off the viscosity, 



Note that r)(T) cx T 1 / 2 . 



(8.112) 



nr ( dV x dV z \ 2 2 
, m (dV g dV a 

U xz ^-nrk B T^ (8.113) 
7] = nk B Tr = ^-nmiv . (8.114) 



How well do these predictions hold up? In fig. 8.5, we plot data for the thermal conductivity of argon and 
the shear viscosity of helium. Both show a clear sublinear behavior as a function of temperature, but the slope 
dlnn/dT is approximately 0.65 and dlnrj/dT is approximately 0.63. Clearly the simple model is not even getting 
the functional dependence on T right, let alone its coefficient. Still, our crude theory is at least qualitatively correct. 
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Why do both k(T) as well as r\ (T) decrease at low temperatures? The reason is that the heat current which flows 
in response to VT as well as the momentum current which flows in response to dV x jdz are due to the presence of 
collisions, which result in momentum and energy transfer between particles. This is true even when total energy 
and momentum are conserved, which they are not in the relaxation time approximation. Intuitively we might 
think that the viscosity should increase as the temperature is lowered, since common experience tells us that fluids 
'gum up' as they get colder - think of honey as an extreme example. But of course honey is nothing like an 
ideal gas, and the physics behind the crystallization or glass transition which occurs in real fluids when they get 
sufficiently cold is completely absent from our approach. In our calculation, viscosity results from collisions, and 
with no collisions there is no momentum transfer and hence no viscosity. If, for example, the gas particles were 
to simply pass through each other, as though they were ghosts, then there would be no opposition to maintaining 
an arbitrary velocity gradient. 



8.5.5 Oscillating external force 

Suppose a uniform oscillating external force F cy± (t) = F e~ lut is applied. For a system of charged particles, this 
force would arise from an external electric field F cxt = qE e~ luJt , where q is the charge of each particle. We'll 
assume VT = 0. The Boltzmann equation is then written 



df , P df 



Fe 



df 



f-f 



dt m dr dp t 

We again write / = f° + Sf, and we assume Sf is spatially constant. Thus, 



dSf „ iut df Sf 
at oe t 



(8.115) 



(8.116) 



If we assume Sf(t) = Sf(u) e luJt then the above differential equation is converted to an algebraic equation, with 
solution 



Sf(t) = -——?fF-v. 
1 — ilot oe 



(8.117) 



We now compute the particle current: 



d d p v Sf 



-iujt ]7 



T e 



1 — ilot k B T. 
t e~ iuJt nF, 



1 — ILOT 3fc B T, 
tit F„ e~ iuJt 



d 3 P f°{p) v a Vp 
d 3 v P(v) v 2 



(8.118) 



If the particles are electrons, with charge q 
then obtain 



m 1 — ILOT 

-e, then the electrical current is (— e) times the particle current. We 



31 



'(*) 



ne t E„ e 



m 



1 — ILOT 



where 



'a/3 



(8.119) 



(8.120) 



m 1 — ILOT 

is the frequency-dependent electrical conductivity tensor. Of course for fermions such as electrons, we should be 
using the Fermi distribution in place of the Maxwell-Boltzmann distribution for f°(p). This affects the relation 
between n and fi only, and the final result for the conductivity tensor (? a g(v) is unchanged. 
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8.5.6 Quick and Dirty Treatment of Transport 

Suppose we have some averaged intensive quantity <f> which is spatially dependent through T(r) or fi(r) or V(r). 
For simplicity we will write 4> = 4>(z). We wish to compute the current of <p across some surface whose equation 
is dz — 0. If the mean free path is £, then the value of <fi for particles crossing this surface in the +z direction is 
4>(z — £ cos 9), where 9 is the angle the particle's velocity makes with respect to z, i.e. cos 9 = v z /v. We perform the 
same analysis for particles moving in the — z direction, for which <f> = 4>(z + £ cos 9). The current of <j> through this 
surface is then 



= nz J d A v P(v) v z 4>(z - £ cos 9) + nz j d A v P(v) v z <f>(z + £ cos 9) 

v z >0 v z <0 \21) 



oz J v 



*nvl — — z 
J oz 



where v = J 8 ^ t T is the average particle speed. If the z-dependence of cf> comes through the dependence of </> on 
the local temperature T, then we have 

j^ = -\ntv^VT=-KVT, (8.122) 

where 

K=\nlv^ (8.123) 

is the transport coefficient. If <fi = (e), then |^ = c p , where c p is the heat capacity per particle at constant pressure. 
We then find j s = —k VT with thermal conductivity 

n=\n£vc p . (8.124) 

Our Boltzmann equation calculation yielded the same result, but with a pref actor of -| instead of | . 

We can make a similar argument for the viscosity. In this case <f> = (p x ) is spatially varying through its dependence 
on the flow velocity V(r). Clearly d(j)/dV x = m, hence 

f Px =n xz = -\nmlv^, (8.125) 

from which we identify the viscosity, r/ = ^nm£v. Once again, this agrees in its functional dependences with the 
Boltzmann equation calculation in the relaxation time approximation. Only the coefficients differ. The ratio of the 
coefficients is K QDC /K BRT = ^ = 0.849 in both cases 6 . 



8.5.7 Thermal diffusivity, kinematic viscosity, and Prandtl number 

Suppose, under conditions of constant pressure, we add heat q per unit volume to an ideal gas. We know from 
thermodynamics that its temperature will then increase by an amount AT = q/nc p . If a heat current j q flows, then 
the continuity equation for energy flow requires 

nc p ^ + V-j q = 0. (8.126) 
6 Here we abbreviate QDC for 'quick and dirty calculation' and BRT for 'Boltzmann equation in the relaxation time approximation' . 
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Pr 


Hp 

1 It: 


1 Q ^ 






U.UOi 


Ar 


22.3 


17.4 


2.50 


0.666 


Xe 


22.7 


5.46 


2.50 


0.659 


H 2 


8.67 


179 


3.47 


0.693 


N 2 


17.6 


25.5 


3.53 


0.721 


o 2 


20.3 


26.0 


3.50 


0.711 


CH 4 


11.2 


33.5 


4.29 


0.74 


CO, 


14.8 


18.1 


4.47 


0.71 


NH 3 


10.1 


24.6 


4.50 


0.90 



Table 8.1: Viscosities, thermal conductivities, and Prandtl numbers for some common gases at T = 293 K and 
p = 1 atm. (Source: Table 1.1 of Smith and Jensen, with data for triatomic gases added.) 



In a system where there is no net particle current, the heat current j q is the same as the energy current j e , and 
since j e — —ft VT, we obtain a diffusion equation for temperature, 

? = -V 2 T. (8.127) 

ot nc p 

The combination 

a = — (8.128) 

nc p 

is known as the thermal diffusivity. Our Boltzmann equation calculation in the relaxation time approximation 
yielded the result k = nk B TTc p /m. Thus, we find a = k^Tr/m via this method. Note that the dimensions of a are 
the same as for any diffusion constant D, namely [a] = L 2 /T. 

Another quantity with dimensions of L 2 /T is the kinematic viscosity, v = rj/ p, where p = nm is the mass density. 
We found 77 = nk B Tr from the relaxation time approximation calculation, hence v — k B TT/m. The ratio v/a, 
called the Prandtl number, Pr = rjc p /mn, is dimensionless. According to our calculations, Pr = 1. According to 
table 8.1, most monatomic gases have Pr r* |. 



8.6 Diffusion and the Lorentz model 
8.6.1 Failure of the relaxation time approximation 

As we remarked above, the relaxation time approximation fails to conserve any of the collisional invariants. It is 
therefore unsuitable for describing hydrodynamic phenomena such as diffusion. To see this, let f(r, v, t) be the 
distribution function, here written in terms of position, velocity, and time rather than position, momentum, and 
time as befor 7 . In the absence of external forces, the Boltzmann equation in the relaxation time approximation is 

df df f - f° 

iL + v -ir = — -■ ( 8 - 129 ) 

at or t 
The density of particles in velocity space is given by 

h(v,t) = Jd\ f(r,v,t) . (8.130) 



7 The difference is trivial, since p = ml). 
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In equilibrium, this is the Maxwell distribution times the total number of particles: h (v) = NP M (v). The number 
of particles as a function of time, N(t) = Jd 3 v n(v, t), should be a constant. 



Integrating the Boltzmann equation one has 



dn n — h v 



Thus, with §n(v, t) = h(v, t) — h (v), we have 

Sn(v,t) = 5h{v,0)e- t/T 



(8.131) 



(8.132) 



Thus, n(v,t) decays exponentially to zero with time constant t, from which it follows that the total particle number 
exponentially relaxes to N . This is physically incorrect; local density perturbations can't just vanish. Rather, they 
diffuse. 



8.6.2 Modified Boltzmann equation and its solution 

To remedy this unphysical aspect, consider the modified Boltzmann equation, 



dt dr 



1 






T 


-/ + 





(8.133) 



where P is a projector onto a space of isotropic functions of v: P F = J |^ F(v) for any function F(v). Note that 
V F is a function of the speed v = \v\. For this modified equation, known as the Lorentz model, one finds d t h = 0. 

The model in eqn. 8.133 is known as the Lorentz model®. To solve it, we consider the Laplace transform, 



f(k, v, s) = dt e~ st d\ e~ ik - r f(r, v, t) 



(8.134) 



Taking the Laplace transform of eqn. 8.133, we find 

(s + iv ■ k + r" 1 ) f(k, v, s) = t" 1 P f(k, v, s) + f(k, v,t = 0) 
We now solve for P /(fc, v, s): 

f(k,v,t = 0) 



which entails 



Now we have 



f(k,v,s 
>f(k,v,s) = 



s + iv ■ k + t 



— P /(*:,«, a) 



dv 



4-7T s + iv ■ k + t 1 



>f(k,v,s] 



s + iv ■ fc + T 1 ' 

rdv f(k,v,t = 0) 



47T s + iv ■ k + T 1 



(8.135) 



(8.136) 



(8.137) 



di) t 

4ir s + iv ■ k + t -1 J s + ivkx + t _1 
-l 



= dx 



= — tan 

vk 



vki 



1 + TS 



s See the excellent discussion in the book by Krapivsky, Redner, and Ben-Nairn, cited in §8.1. 



(8.138) 
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Thus, 



>f(k,v,s) = 



1 ; — tan 

VKT 



vkr 

l + TS 



dv f(k,v,t=0) 
4tt s + iv ■ k + r -1 



(8.139) 



We now have the solution to Lorentz's modified Boltzmann equation: 



f(k,v,s) 



s + iv ■ k + t 1 



1 — tan" 1 

VKT 

f(k,v,t = 0) 
s + iv ■ k + t _1 



vkr 

1 + TS 



dv f(k,v,t=0) 
4tt s + iv ■ k + r -1 



(8.140) 



Let us assume an initial distribution which is perfectly localized in both r and v. 

f(r,v,t = 0) = 6(v-v ). 

For these initial conditions, we find 

rdv f(k,v,t = 0) 1 S(v-v ) 



4tt s + iv ■ k + t 1 s + iv Q ■ k + t -1 AttVq 



We further have that 



1 i / vkr 

— tan" 1 

vkr \ 1 + ts 



1 ^ tan" 1 1 t— | = st + \k 2 v 2 T 2 



(8.141) 



(8.142) 



(8.143) 



and therefore 



f(k,v,s) 



s + iv ■ k + t 1 s + iv Q ■ k + r 1 s + k 2 t 

5{v-v ) 
s + iv Q ■ k + r -1 



(8.144) 



We are interested in the long time limit t ^> r for f(r,v,t). This is dominated by s ~ t 1 , and we assume that r 1 
is dominant over s and iv ■ k. We then have 



f(k,v,s) 



1 S(v-v ) 



S + k 2 T 47TVq 



Attv 2 



Performing the inverse Laplace and Fourier transforms, we obtain 
where the diffusion constant is 

The units are [D] = L 2 /T. Integrating over velocities, we have the density 



D = \v 2 t 



Note that 



for all time. Total particle number is conserved! 



(r, t) = d 3 v f(r, v, t) = {AnDt) 



d\ n(r, t) = 1 



-3/2 p ~r 2 /4Dt 



(8.145) 

(8.146) 
(8.147) 

(8.148) 
(8.149) 
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8.7 Linearized Boltzmann Equation 

8.7.1 Linearizing the collision integral 

We now return to the classical Boltzmann equation and consider a more formal treatment of the collision term in 
the linear approximation. We will assume time-reversal symmetry, in which case 

(§9 u = J d3 PiJ d3 P'l d3 Pi w (P'>P'i IP'Px) {f(p') /(Pi) - /(P) /(Pi)} ■ ( 815 °) 

The collision integral is nonlinear in the distribution /. We linearize by writing 

f(p) = f(p) + f(p)Tp(p), (8.151) 
where we assume ip(p) is small. We then have, to first order in i/j, 

where the action of the linearized collision operator is given by 

U = Jd^JdyJd 3 ^ w{p\p\ | P , Pl ) f( Pl ) (v(p') + Wi) - i>(p) - Hpi)} 
= Jd 3 Pl Jdn \ v - Vl \!^L f°( Pl ) {tP( p ') + vXp'i) - 1>(p) - V(pi)} , 



(8.153) 



where we have invoked eqn. 8.55 to write the RHS in terms of the differential scattering cross section. In deriving 
the above result, we have made use of the detailed balance relation, 

f(p)f Q (p 1 ) = f Q (p , )f°(p' 1 ). (8.154) 

We have also suppressed the r dependence in writing f(p), /°(p), and ip{p)- 

From eqn. 8.85, we then have the linearized equation 

L -Vl^> = Y > (8-155) 



dt 



where, for point particles, 



Y = ^f\ E{P) T CpT v.VT + mv a v Q a ,-^-V.V-F. v y (8.156) 
Eqn. 8.155 is an inhomogeneous linear equation, which can be solved by inverting the operator L — J^-. 

8.7.2 Linear algebraic properties of L 

Although L is an integral operator, it shares many properties with other linear operators with which you are 
familiar, such as matrices and differential operators. We can define an inner product 9 , 

(^|l& 2 > = Jd 3 P f(p)Mp)Mp) ■ (8-157) 



'The requirements of an inner product (f\g) are symmetry, linearity, and non-negative definiteness. 
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Note that this is not the usual Hilbert space inner product from quantum mechanics, since the factor f°(p) is 
included in the metric. This is necessary in order that L be self-adjoint: 

(^i|LV a > = (£lMlM- ( 8 - 158 ) 

We can now define the spectrum of normalized eigenfunctions of L, which we write as 4> n {p). The eigenf unctions 
satisfy the eigenvalue equation, 

U n = -KK, (8-159) 

and may be chosen to be orthonormal, 

5 mn . (8.160) 



I rti , 



Of course, in order to obtain the eigenfunctions <f> n we must have detailed knowledge of the function w(p' , p\ \p,Pi) 

Recall that there are five collisional invariants, which are the particle number, the three components of the total 
particle momentum, and the particle energy. To each collisional invariant, there is an associated eigenfunction <fr n 
with eigenvalue A„ = 0. One can check that these normalized eigenfunctions are 

0„(p) = 4= ( 8i61 ) 

<t> p (p) = -r^== (8.162) 
*.<P) = \fl( e M-?). (8-163) 



3n \k B T 2, 

If there are no temperature, chemical potential, or bulk velocity gradients, and there are no external forces, then 
Y = and the only changes to the distribution are from collisions. The linearized Boltzmann equation becomes 

^ = 14 . (8.164) 
We can therefore write the most general solution in the form 

4,{p,t)=Y^CMp)z~ Kt , (8-165) 

n 

where the prime on the sum reminds us that collisional invariants are to be excluded. All the eigenvalues A n , 
aside from the five zero eigenvalues for the collisional invariants, must be positive. Any negative eigenvalue 
would cause ip(p, t) to increase without bound, and an initial nonequilibrium distribution would not relax to the 
equilibrium f°(p), which we regard as unphysical. Henceforth we will drop the prime on the sum but remember 
that C n = for the five collisional invariants. 

Recall also the particle, energy, and thermal (heat) currents, 

j = J d 3 p v f(p) = J d 3 p f°(p) v ^(p) = (v\tp) 

j e = J d 3 P v e f(p) = J d 3 P f°(p) v e Hp) ={ve\i>) (8.166) 
j q = Jd 3 p v{e-n) f(p) = Jd 3 P f°(p) v(e-h) ^(p) = {v(e-n)\if>) . 



Notej 9 =j e - nj. 
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8.7.3 Steady state solution to the linearized Boltzmann equation 

Under steady state conditions, there is no time dependence, and the linearized Boltzmann equation takes the form 

Ltp = Y. (8.167) 

We may expand ip in the eigenf unctions 4> n and write ip = J^n^n't'n- Applying L and taking the inner product 
with (f>y we have 

C 5 = -^{4>j\Y) . (8.168) 
Thus, the formal solution to the linearized Boltzmann equation is 

i 1 (p)=-Y d ^-(<t>n\Y)4>n(p)- (8-169) 
n n 

This solution is applicable provided | Y ) is orthogonal to the five collisional invariants. 
Thermal conductivity 

For the thermal conductivity, we take VT = d z T x, and 

where X K = (e — c p T) v x . Under the conditions of no particle flow (j = 0), we have j q = —k d x T x. Then we have 

{X r \^) = -k^. (8.171) 

Viscosity 

For the viscosity, we take 



with X r/ = v x v y . We then 



Thus, 



8.7.4 Variational approach 



dy 



IL xy = (mv x v y \i;) = -r,^. (8.173) 



n dV 

X n i> ) = -H—£l. (8.174) 
' m ay 



Following the treatment in chapter 1 of Smith and Jensen, define H = —L. We have that H is a positive semidef- 
inite operator, whose only zero eigenvalues correspond to the collisional invariants. We then have the Schwarz 
inequality, 

(VHH|VM0|£|0)>(0|^) 2 , (8-175) 
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for any two Hilbert space vectors | ip ) and \<f>). Consider now the above calculation of the thermal conductivity. 
We have 

^ = -A^I X « < 8 - 176 > 

and therefore 

Similarly for the viscosity we have 

^ = Srlt x - (8178) 

from which we derive 

[dVJdyy k B T { < f,\H\4>) 

In order to get a good lower bound, we want 4> in each case to have a good overlap with X K . One approach then 
is to take (f> = X K , which guarantees that the overlap will be finite (and not zero due to symmetry for example). 
We illustrate this method with the viscosity calculation. We have 

T) > x x y „ y ' . (8.180) 

K T (v x v y \H\v x v y ) 

Now the linearized collision operator L acts as 

(<£|£|V0 = Jd 3 P9 (p)Hp)Jd 3 P 1 JdQ^\v~v 1 \fy Pl )[^(p)+^( Pl )-^(p')-i:( P / 1 )} . (8.181) 

Here the kinematics of the collision guarantee total energy and momentum conservation, so p' and p\ are deter- 
mined as in eqn. 8.56. 

Now we have 

dQ = sinxdxd(p, (8.182) 

where \ is the scattering angle depicted in Fig. 8.6 and ip is the azimuthal angle of the scattering. The differential 
scattering cross section is obtained by elementary mechanics and is known to be 



da_ 

da ~ 

where b is the impact parameter. The scattering angle is 



d(b 2 /2) 



dsinx 



(8.183) 



X (b, u) = 7T - 2 / dr == , (8.184) 

where m = \m is the reduced mass, and r p is the relative coordinate separation at periapsis, i.e. the distance of 
closest approach, which occurs when r = 0, i.e. 



\mu 2 = 

* 2mr£ 



U(r p ) , (8.185) 



where £ = mub is the relative coordinate angular momentum. 
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b+db 



Figure 8.6: Scattering in the CM frame. O is the force center and P is the point of periapsis. The impact parameter 
is b, and \ is the scattering angle. (f>o is the angle through which the relative coordinate moves between periapsis 
and infinity. 



We work in center-of-mass coordinates, so the velocities are 

v = V + \u 
v 1 = V — \u 



v' = V + fit' 
v[ = V - \v! 



with |m| = \u'\ and u ■ u' = cos \- Then if ijj(p) — v x v y , we have 

A(v>) = V(p) + V>(Pi) - Hp') - iip'i) = W x u y - u' x u' y ) 

We may write 

u' = u ^sin x cos tp e 1 + sin \ sin if e 2 + cos \ e 3 
where e 3 = u. With this parameterization, we have 

27T 



Jdip \{u a u p - u a u p ) = -7rsin 2 x (u 2 S a/3 - 3u a u ) 



Note that we have used here the relation 

e la e 10 + e 2a e 2/3 + e 3a e 3/3 = $af3 ; 

which holds since the LHS is a projector Xa=i l^i) (^sl- 
It is convenient to define the following integral: 



R(u) = Jdbb sm 2 x(b,u) 




Since the Jacobian 



det 



(dv, dv-^) 
(dV, du) 



= 1, 



(8.186) 
(8.187) 



(8.188) 
(8.189) 

(8.190) 

(8.191) 



(8.192) 



(8.193) 
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we have 

( v x v y | L | v x v y > = n 2 Jd 3 V f d\ e~^l^ e ^/^T . u . . R{u) . ^ 

This yields 



oo oo 

(F(u)) = fduu 2 e~ mu2 / 4k * T F{u)l Jduu 2 e~ mu2 l Ak * 



It is easy to compute the term in the numerator of eqn. 8.180: 

\ 3 / 2 r „ / u tA 2 



<v x v y \v x v y ) = n 



2nk„T 



Putting it all together, we find 



The computation for k is a bit more tedious. One has ip(p) = (s — c p T) v x , in which case 

A&) = ±m[(V-u)u x -(V- U ')u' x . 
Ultimately, one obtains the lower bound 



(8.194) 



( v x v y | L | v x v y ) = ^n 2 (u 5 R(u)) , (8.195) 

where 



(8.196) 



' Jd\e- mv2 / 2k * T vlvl=n(^pj . (8.197) 



^ 40(fc B r) 3 / B ^ (g i98) 



(8.199) 



k > 150fc B( fc B T ) 3 / / u 5 R{u) x (8 200) 

Trm 3 / 

Thus, independent of the potential, this variational calculation yields a Prandtl number of 

Pr = H = H£p = a (8 .201) 

a m k J 

which is very close to what is observed in dilute monatomic gases (see Tab. 8.1). 

While the variational expressions for r) and n are complicated functions of the potential, for hard sphere scattering 
the calculation is simple, because b = dsin </> = dcos(^x)' where d is the hard sphere diameter. Thus, the impact 
parameter b is independent of the relative speed u, and one finds R(u) = ^d 3 . Then 



(u 5 R(u)) = U 3 (u 5 ) = H£ ( ^] /2 d 2 



and one finds 

5 



r,> 5 t k flT , ->Jfe^f. (8-203) 
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8.8 The Equations of Hydrodynamics 

We now derive the equations governing fluid flow. The equations of mass and momentum balance are 

| + V.( P y)= (8.204) 

d( P v a ) , 9n aP 

—ar + = ■ (8 - 205) 

where 



^ a p = pV a V p + P 5 aP - U^ + ^-IV-VS a ^+tV-V5 a A . (8.206) 

Substituting the continuity equation into the momentum balance equation, one arrives at 

BV 

p ~bJ + p {v ' v)v = ~ Vp + v v + (c + i^ v ( v ' y ) . ( 8 - 207 ) 

which, together with continuity, are known as the Navier-Stokes equations. These equations are supplemented by 
an equation describing the conservation of energy, 

T ^ +TV ' (sV)= ^S + V ' (kVT) - (8 - 208) 

Note that the LHS of eqn. 8.207 is p D V /Dt, where D/Dt is the convective derivative. Multiplying by a differential 
volume, this gives the mass times the acceleration of a differential local fluid element. The RHS, multiplied by 
the same differential volume, gives the differential force on this fluid element in a frame instantaneously moving 
with constant velocity V. Thus, this is Newton's Second Law for the fluid. 



8.9 Nonequilibrium Quantum Transport 
8.9.1 Boltzmann equation for quantum systems 

Almost everything we have derived thus far can be applied, mutatis mutandis, to quantum systems. The main 
difference is that the distribution f° corresponding to local equilibrium is no longer of the Maxwell-Boltzmann 
form, but rather of the Bose-Einstein or Fermi-Dirac form, 

where the top sign applies to bosons and the bottom sign to fermions. Here we shift to the more common notation 
for quantum systems in which we write the distribution in terms of the wavevector k = p/H rather than the 
momentum p. The quantum distributions satisfy detailed balance with respect to the quantum collision integral 




(8.210) 



where w = w(k,k 1 | k',k[), f = f(k), f 1 = f{k 1 ), f = f(k'), and f[ = f(k' 1 ), and where we have assumed 
time-reversal and parity symmetry. Detailed balance requires 
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where / = f is the equilibrium distribution. One can check that 

i f 

f = =>• — — = e ^~ e > (8 212) 

7 e^(=-M) T i ^ 1±/ ' ^ ' 

which is the Boltzmann distribution, which we have already shown to satisfy detailed balance. For the streaming 
term, we have 



de V K T 



de \ k B T k B T 2 k B T 

9f° fgg dr | e-^dT ^ de 
de 1 dr T dr dk 



from which we read off 



df 


df f 


dr 


<9e \ 


df 


„ 5/° 


dk 


£7£ 



9/x e — \i dT 
dr T dr 



(8.214) 



The most important application is to the theory of electron transport in metals and semiconductors, in which case 
f is the Fermi distribution. In this case, the quantum collision integral also receives a contribution from one-body 
scattering in the presence of an external potential U (r), which is given by Fermi's Golden Rule: 



k'e n 
2tt /" d 3 fc 



fiV 7 (2?r) 



J7(fe-fc')| 2 (/(fc')-/(fc))^(^)-£(fc')) 



(8.215) 



<> 



The wavevectors are now restricted to the first Brillouin zone, and the dispersion e(k) is no longer the ballistic 
form e = h 2 k 2 /2m but rather the dispersion for electrons in a particular energy band (typically the valence band) 
of a solid 10 . Note that / = f satisfies detailed balance with respect to one-body collisions as well 11 . 

In the presence of a weak electric field E and a (not necessarily weak) magnetic field B, we have, within the 
relaxation time approximation, / = f + 5f with 



cMf _ _e_ r dSf 
dt he dk 



T 



df° Sf 

^- = -- , (8.216) 

de t 



where £ = — V(0 — /u/e) = E — e 1 V/j is the gradient of the 'electrochemical potential' <f> — e 1 fx. In deriving the 

dSf 
dr 

dk 



above equation, we have worked to lowest order in small quantities. This entails dropping terms like v-^- (higher 
order in spatial derivatives) and E ■ (both E and Sf are assumed small). Typically r is energy-dependent, i.e. 



re 



10 We neglect interband scattering here, which can be important in practical applications, but which is beyond the scope of these notes. 
11 The transition rate from [fe') to |fe) is proportional to the matrix element and to the product /'(l — /). The reverse process is proportional 
to /(l — /'). Subtracting these factors, one obtains /' — /, and therefore the nonlinear terms felicitously cancel in eqn. 8.215. 
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We can use eqn. 8.216 to compute the electrical current j and the thermal current j , 

d 3 k 

W) 



2e l7KZ^ vS f ( 8 - 217 ) 



/d 3 k 
— (s-„)v5f. (8.218) 

n 

Here the factor of 2 is from spin degeneracy of the electrons (we neglect Zeeman splitting). 

In the presence of a time-independent temperature gradient and electric field, linearized Boltzmann equation in 
the relaxation time approximation has the solution 

Sf = -r(e) v ■ (eS+ ^Vt) . (8.219) 

We now consider both the electrical current 12 j as well as the thermal current density j q . One readily obtains 

f d 3 k 

j = -2ej—v5f = L 11 £-L 12 VT (8.220) 



n 



d 3 k 



U = 2 17^3 (e-p)vdf = L 21 £-L 22 VT (8.221) 



where the transport coefficients L 11 etc. are matrices: 

< - Tig - - jjj (. - ,) (-§£) /«, ^ (8.223) 
If we define the hierarchy of integral expressions 

& s si ( £ - "> w (-^) M 9f (8 - 225) 



then we may write 



Lt = e 2 jf , L a £ = TLt = -e3f , L$ = ±jf. (8.226) 



(8.227) 



The linear relations in eqn. (8.221) may be recast in the following form: 

S= P j + QVT 

j v = nj - K VT , 

where the matrices p, Q, n, and k are given by 

p = L~ x l Q = L\l L 12 (8.228) 

n = L 21 Li* ft = L 22 - L 21 Lfj 1 L 12 , (8.229) 



12 In this section we use j to denote electrical current, rather than particle number current as before. 
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T=Tj 




V = (Qe-QJCT.-Tj 

Figure 8.7: A thermocouple is a junction formed of two dissimilar metals. With no electrical current passing, an 
electric field is generated in the presence of a temperature gradient, resulting in a voltage V = Va — Vb ■ 



or, in terms of the J n , 



n 



J* 1 

- J\ Jo 



eT 



= ji[J2-J 1 J J X 



(8.230) 
(8.231) 



These equations describe a wealth of transport phenomena: 



• Electrical resistance (VT = B = 0) 

An electrical current j will generate an electric field £ = pj, where p is the electrical resistivity. 

• Peltier effect (VT = B = 0) 

An electrical current j will generate an heat current j q = nj, where n is the Peltier coefficient. 

• Thermal conduction (j = B = 0) 

A temperature gradient VT gives rise to a heat current j q = -kVT, where k is the thermal conductivity. 

• Seebeck effect (j = B = 0) 

A temperature gradient VT gives rise to an electric field £ = Q VT, where Q is the Seebeck coefficient. 



One practical way to measure the thermopower is to form a junction between two dissimilar metals, A and B. The 
junction is held at temperature T\ and the other ends of the metals are held at temperature T . One then measures 
a voltage difference between the free ends of the metals - this is known as the Seebeck effect. Integrating the 
electric field from the free end of A to the free end of B gives 

B 

V A -V B = - Js-dl = (Q B - Qk){T\ - T ) . (8.232) 

A 

What one measures here is really the difference in thermopowers of the two metals. For an absolute measurement 
of Qa, replace B by a superconductor (Q = for a superconductor). A device which converts a temperature 
gradient into an emf is known as a thermocouple. 
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A 


B 






i 





i q = (n B nj-i 

Figure 8.8: A sketch of a Peltier effect refrigerator. An electrical current / is passed through a junction between 
two dissimilar metals. If the dotted line represents the boundary of a thermally well-insulated body then the body 
cools when [~1b > flA, in order to maintain a heat current balance at the junction. 



The Peltier effect has practical applications in refrigeration technology. Suppose an electrical current I is passed 
through a junction between two dissimilar metals, A and B. Due to the difference in Peltier coefficients, there will 
be a net heat current into the junction of W = (FIa — rie) I- Note that this is proportional to I, rather than the 
familiar I 2 result from Joule heating. The sign of W depends on the direction of the current. If a second junction 
is added, to make an ABA configuration, then heat absorbed at the first junction will be liberated at the second. 13 



, V • j = (8.233) 



8.9.2 The Heat Equation 

We begin with the continuity equations for charge density p and energy density e: 

dp 

^- + V-j e =j-E, (8.234) 

where E is the electric field 14 . Now we invoke local thermodynamic equilibrium and write 

de de dn de dT 
dt = dn ~di + df ~di 



Hdp dT 

- e m +c ^> (8 - 235) 



13 To create a refrigerator, stick the cold junction inside a thermally insulated box and the hot junction outside the box. 
14 Note that it is E ■ j and not £■ j which is the source term in the energy continuity equation. 
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where n is the electron number density (n = —p/e) and c v is the specific heat. We may now write 

dT de p dp 



dt dt e dt 

= j-E-V-j £ -^V-j 

= jS-Vj q . (8.236) 

Invoking j = nj — kV T, we see that if there is no electrical current (j = 0), we obtain the heat equation 

dT 8*1 

= d^d^ ■ (8 - 237) 

This results in a time scale r T for temperature diffusion r T = CL 2 c v /k, where L is a typical length scale and C is 
a numerical constant. For a cube of size L subjected to a sudden external temperature change, L is the side length 
and C = 1/37T 2 (solve by separation of variables). 



8.9.3 Calculation of Transport Coefficients 



We will henceforth assume that sufficient crystalline symmetry exists (e.g. cubic symmetry) to render all the trans- 
port coefficients multiples of the identity matrix. Under such conditions, we may write J" & = J n 5 a p with 



= ^ few- few 

The low-temperature behavior is extracted using the Sommerfeld expansion 

1= jdsH{e) {^-^l^j =TrT>csc{iTV)H(e) 



H(p) + -(k B T) 2 H"(p) 



(8.238) 

(8.239) 
(8.240) 



where V = k B T is a dimensionless differential operator. 



15 



Let us now perform some explicit calculations in the case of a parabolic band with an energy-independent scat- 
tering time r. In this case, one readily finds 



- 3 / 2 irV cscirVs 3 / 2 (s - p) n 



(8.241) 



where a Q = ne 2 T/m*. Thus, 



Jo = 
Ji = 

J2 = 



8 p 2 



Vo* 2 (k B T) 2 
e 2 2 p 

2 

6 



(8.242) 



15 Remember that physically the fixed quantities are temperature and total carrier number density (or charge density in the case of electron 
and hole bands), and not temperature and chemical potential. An equation of state relating n, fi, and T is then inverted to obtain [±(n, T), so 
that all results ultimately may be expressed in terms of n and T. 
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from which we obtain the low-T results p = a Q 1 , 



Q = ~— «=~&, (8.243) 

2 ee F 3 m* 

and of course n = TQ. The predicted universal ratio 

^ = y (fc B /e) 2 = 2.45 x 10" 8 V 2 K~ 2 , (8.244) 

is known as the Wiedemann-Franz law. Note also that our result for the thermopower is unambiguously nega- 
tive. In actuality, several nearly free electron metals have positive low-temperature thermopowers (Cs and Li, for 
example). What went wrong? We have neglected electron-phonon scattering! 

8.9.4 Onsager Relations 

Transport phenomena are described in general by a set of linear relations, 

Ji = Uk F k , (8.245) 

where the {Fk} are generalized forces and the { JJ are generalized currents. Moreover, to each force Fi corresponds a 
unique conjugate current Ji, such that the rate of internal entropy production is 

dS_ 



s = Y,*iJi => F * = 7rr- ( 8 - 246 ) 



The Onsager relations (also known as Onsager reciprocity) state that 

L ik {B) = mm L ki {-B) , (8.247) 

where r)i describes the parity of Ji under time reversal: 

Jj =mJi, (8-248) 

where J J is the time reverse of J { . To justify the Onsager relations requires a microscopic description of our 
nonequilibrium system. 

The Onsager relations have some remarkable consequences. For example, they require, for B = 0, that the thermal 
conductivity tensor of any crystal must be symmetric, independent of the crystal structure. In general,this 
result does not follow from considerations of crystalline symmetry. It also requires that for every 'off-diagonal' 
transport phenomenon, e.g. the Seebeck effect, there exists a distinct corresponding phenomenon, e.g. the Peltier 
effect. 

For the transport coefficients studied, Onsager reciprocity means that in the presence of an external magnetic field, 

P a p(B) = P Pa (-B) (8.249) 

M B ) = K p a (-B) (8.250) 

n ap (B)=TQ 0a (-B). (8.251) 

Let's consider an isotropic system in a weak magnetic field, and expand the transport coefficients to first order in 
B: 

Pafi(B) = pS ap + ve afi7 Bn (8.252) 
K afl (B) = K6 af) +me afh &' (8.253) 
Q Q/3 ( J B) = g ( 5 Q/3 + C^ 7 S 7 (8.254) 

n aP (B) = ri5 afi + 0e afh Bry . (8.255) 
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Onsager reciprocity requires n = T Q and 8 = T (. We can now write 

£= Pi +vj x B + QVT + CVT x B (8.256) 
j q = r\j + 6j X B - kVT -mVT x B . (8.257) 

There are several new phenomena lurking: 

. Hall effect (f = f =j v =0) 

An electrical current j = j x x and a field B = B z z yield an electric field £. The Hall coefficient is i?H = 

£ y /3x B z = -v- 

• Ettingshausen effect = j y = j q y = 0) 

An electrical current j = j x x and a field B = B z z yield a temperature gradient ^ . The Ettingshausen 
coefficient is P = §E B 2 = -0/k. 

• Nernst effect tf a = j„ = = 0) 

A temperature gradient VT = and a field B = B z z yield an electric field £. The Nernst coefficient is 

A = £„/gB, = -C. 

• Righi-Leduc effect (j x = j = £ y = 0) 

A temperature gradient VT = £i and a field B = B z z yield an orthogonal temperature gradient ^ . 
The Righi-Leduc coefficient is C = /||S 2 = C/Q. 

8.10 Stochastic Processes 

A stochastic process is one which is partially random, i.e. it is not wholly deterministic. Typically the randomness is 
due to phenomena at the microscale, such as the effect of fluid molecules on a small particle, such as a piece of dust 
in the air. The resulting motion (called Brownian motion in the case of particles moving in a fluid) can be described 
only in a statistical sense. That is, the full motion of the system is afunctional of one or more independent random 
variables. The motion is then described by its averages with respect to the various random distributions. 



8.10.1 Langevin equation and Brownian motion 

Consider a particle of mass M subjected to dissipative and random forcing. We'll examine this system in one 
dimension to gain an understanding of the essential physics. We write 

p + IP = F + n(t) . (8.258) 

Here, 7 is the damping rate due to friction, F is a constant external force, and rj(t) is a stochastic random force. 
This equation, known as the Langevin equation, describes a ballistic particle being buffeted by random forcing 
events. Think of a particle of dust as it moves in the atmosphere; F would then represent the external force due 
to gravity and r)(t) the random forcing due to interaction with the air molecules. For a sphere of radius a moving 
with velocity v in a fluid, the Stokes drag is given by -F^rag = —Qnriav, where a is the radius. Thus, 

where M is the mass of the particle. It is illustrative to compute 7 in some setting. Consider a micron sized 
droplet (a = 10~ 4 cm) of some liquid of density p ~ 1.0 g/cm 3 moving in air at T = 20° C. The viscosity of air is 
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rj = 1.8 x 10~ 4 g/cm • s at this temperature 16 . If the droplet density is constant, then 7 = 9ri/2pa 2 = 8.1 x 10 4 s -1 , 
hence the time scale for viscous relaxation of the particle is r = j^ 1 = 12 jus. We should stress that the viscous 
damping on the particle is of course due to the fluid molecules, in some average 'coarse-grained' sense. The 
random component to the force rj(t) would then represent the fluctuations with respect to this average. 

We can easily integrate this equation: 

j t (pe^) =Fe^ + 77(t)e 7 * 

* (8.260) 

p{t) = p(0) e" 7 ' + - (1 - e~ 7 *) + Ids r](s) e^ s ~ t] 



Note that pit) is indeed a functional of the random function rj(t) . We can therefore only compute averages in order 
to describe the motion of the system. 

The first average we will compute is that of p itself. In so doing, we assume that rj(t) has zero mean: (77 (i)) = 0. 
Then 

(p(t)) = p(0) e-* + - (1 - e~^) . (8.261) 

On the time scale 7 _1 , the initial conditions p(0) are effectively forgotten, and asymptotically for t » 7 _1 we have 
(p(t)) — ► F/7, which is the terminal momentum. 

Next, consider 

t t 

( P 2 (t)) = (p(t)f + Jd Sl Jds 2 erf'i-*) e^-V ( V ( Sl )r,( S2 )) . (8.262) 


We now need to know the two-time correlator (^(s-J t](s 2 ))- We assume that the correlator is a function only of 
the time difference As = s x — s 2 , so that the random force r)(s) satisfies 

( V (s)) = (8.263) 
(V(s 1 )v(s 2 )) = ^i-s 2 ). (8.264) 

The function cf>(s) is the autocorrelation function of the random force. A macroscopic object moving in a fluid is 
constantly buffeted by fluid particles over its entire perimeter. These different fluid particles are almost completely 
uncorrelated, hence <f>(s) is basically nonzero except on a very small time scale t±, which is the time a single fluid 
particle spends interacting with the object. We can take Tj, — > and approximate 

<j>(s) ss r S(s) . (8.265) 

We shall determine the value of r from equilibrium thermodynamic considerations below. 
With this form for <fi(s), we can easily calculate the equal time momentum autocorrelation: 



(p\t)) = (p{t)) 2 + r dse^-V 







(8.266) 



= <^)> 2 + £ (1 - e- 2 ^) 



16 The cgs unit of viscosity is the Poise (P). 1 P = 1 g/cm-s. 
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Consider the case where F = and the limit t > 7 1 . We demand that the object thermalize at temperature T. 
Thus, we impose the condition 

2 S) = \KT => r = 2jMk B T , (8.267) 

where M is the particle's mass. This determines the value of r. 
We can now compute the general momentum autocorrelator: 

t t' 

(p(t)p(t')) - (p(t))(p(t')) = fds jds 1 e^'-^ ( V (s) r,(s')) 





= Aftje" 71 ^'' 1 (t,t' -► 00 , |t - t'| finite) 



(8.268) 



The full expressions for this and subsequent expressions, including subleading terms, are contained in an ap- 
pendix, §8.14. 

Let's now compute the position x(t). We find 

t s 

x(t) = (x(t)) + JL Jds Jd Sl v ( Sl )e^-^ , (8.269) 


where 

Ft 1 / P \ 

<-(*)) = *(°) + + - («(0) - W ) (1 - e-r*) • (8.270) 

Note that for jt <C 1 we have (x(t)) = x(0) + v(0) t + \M~ l Ft 2 + 0(t 3 ), as is appropriate for ballistic particles 
moving under the influence of a constant force. This long time limit of course agrees with our earlier evaluation 
for the terminal velocity, = <^p(oo)) /M = F/jM. We next compute the position autocorrelation: 

t t' s s 

(x(t)x(t>)) - (x(t))(x(t')) = -L f ds J ds ' e -7C+»') Jds.jds'.e^+^inis,)^)) 



m\n(t,t')+0(l) . 



M 2 

b b 

2KT 



7M 

In particular, the equal time autocorrelator is 

( x 2 (t))-{ x (t)) 2 = 2 -^±^2Dt, (8.271) 
at long times, up to terms of order unity. Here, 

k T 

D = -S— (8.272) 

7M 

is the diffusion constant. For a liquid droplet of radius a = 1 /im moving in air at T = 293 K, for which 77 = 
1.8 x 1CT 4 P, we have 

D= K L= (1.38 x 10~ 16 crg/K) (293 K) 

67T? ? a 6tt(1.8 x 10" 4 P) (10- 4 cm) 1 v y 

This result presumes that the droplet is large enough compared to the intermolecular distance in the fluid that one 
can adopt a continuum approach and use the Navier-Stokes equations, and then assuming a laminar flow. 
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If we consider molecular diffusion, the situation is quite a bit different. As we shall derive below in §8.10.3, the 
molecular diffusion constant is D = £ 2 /2t, where I is the mean free path and r is the collision time. As we found 
in eqn. 8.91, the mean free path £, collision time r, number density n, and total scattering cross section a are related 
by 

£ = vr = - J— , (8.274) 
V 2 no 

where v = \J 8fc B T/ 7rm is the average particle speed. Approximating the particles as hard spheres, we have 
a = Ana 2 , where a is the hard sphere radius. At T = 293 K, and p = 1 atm, we have n = p/k B T = 2.51 x 
10 19 cm -3 . Since air is predominantly composed of N 2 molecules, we take a = 1.90 x 10~ 8 cm and m = 28.0 amu = 
4.65 x 10 -23 g, which are appropriate for N 2 . We find an average speed of v = 471 m/s and a mean free path of 
£ = 6.21 x 10~ 6 cm. Thus, D = j£v = 0.146 cm 2 /s. Though much larger than the diffusion constant for large 
droplets, this is still too small to explain common experiences. Suppose we set the characteristic distance scale 
at d = 10 cm and we ask how much time a point source would take to diffuse out to this radius. The answer is 
At = d 2 /2D = 343 s, which is between five and six minutes. Yet if someone in the next seat emits a foul odor, 
your sense the offending emission in on the order of a second. What this tells us is that diffusion isn't the only 
transport process involved in these and like phenomena. More important are convection currents which distribute 
the scent much more rapidly. 

8.10.2 Langevin equation for a particle in a harmonic well 

Consider next the equation 

MX + jMX + Mw 2 X = F + rj(t) , (8.275) 
where F is a constant force. We write X = + x and measure x relative to the potential minimum, yielding 

x + 7± + u>%x=— n(t). (8.276) 
At this point there are several ways to proceed. 

Perhaps the most straightforward is by use of the Laplace transform. Recall: 

oo 

x{v) = fdte- vt ri(v) (8.277) 



o 

dp 



*(*) = J^j e+Vti (r)> ( 8 - 278 ) 

c 

where the contour C proceeds from a — ioo to a + too such that all poles of the integrand lie to the left of C. We 
then have 

oo oo 

jj Jdte-» t n(t) = jj Jdte- Vt (x + 1 x + u J 2 x) 



= -{y + 7) a;(0) - i(0) + {y 2 + ~fis + ul) x{u) . (8.279) 



Thus, we have 



{v + 7) x(0) + x(0) 1 1 

xiv) = = 5 h — • -r, , I at e r](t) . (8.280) 

w v 2 + + ujI M V 2 + + LO 2 
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Now we may write 

v 2 +jis + lu 2 = {v-v + ){v-v_) , (8.281) 

where 



v± = -h ± \/h 2 -^ 2 o- ( 8 - 282 ) 

Note that Re (v±) < and that 7 + v± = -v T . 
Performing the inverse Laplace transform, we obtain 



z/_|_ — V / — v_ \ J 

00 

+ ldsK(t- s)rj{s) , 



(8.283) 







where 

K (t _ s ) = M Q ^~_ 3) v ) _ e "-(*- s )) (8.284) 

is the response kernel and 0(£ — s) is the step function which is unity for t > s and zero otherwise. The response is 
causal, i.e. x(t) depends on r)(s) for all previous times s < t, but not for future times s > t. Note that A'(r) decays 
exponentially for r — ► 00, if Re(z/ ± ) < 0. The marginal case where uj a = and v + = corresponds to the diffusion 
calculation we performed in the previous section. 

8.10.3 Discrete random walk 

Consider an object moving on a one-dimensional lattice in such a way that every time step it moves either one 
unit to the right or left, at random. If the lattice spacing is t, then after n time steps the position will be 

n 



where 

{ + 1 if motion is one unit to right at time step j 
— 1 if motion is one unit to left at time step j . 

Clearly {g-) = 0, so (x n ) = 0. Now let us compute 



(8.286) 



j=ij'=i 

where we invoke 

(W) = 8 Jjt . (8.288) 
If the length of each time step is r, then we have, with t = jit, 

e 2 

(x 2 (t)) = —t, (8.289) 



T 



and we identify the diffusion constant 



D = — . (8.290) 
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Suppose, however, the random walk is biased, so that the probability for each independent step is given by 

P(a)= P 6 aA +q6 a i, (8.291) 



where p + q = 1. Then 
and 

Then 



cr,) =p-<? = 2p-l (8.292) 

(8.293) 



(a J v j ,) = (p-qy{l-6 jj ,)+5 jJ , 
= (2p-l) 2 + 4p(l-p)S jf 



{x n ) = {2p-l)ln (8.294) 
{xl)-{x n f =4p(l-p)£ 2 n. (8.295) 



8.10.4 Fokker-Planck equation 



Suppose x(t) is a stochastic variable. We define the quantity 

Sx(t) = x(t + St) - x(t) , (8.296) 

and we assume 

(Sx(t)) =Fi(x(t)) St (8.297) 

([Sx(t)] 2 ) =F 2 (x(t))St (8.298) 

but ( [5x(f )] ™) = 0((St) 2 ) for n > 2. The n = 1 term is due to drift and the n = 2 term is due to diffusion. Now 
consider the conditional probability density, P(x, t \ x , t Q ), defined to be the probability distribution for x = x(t) 
given that x(t ) = x . The conditional probability density satisfies the composition rule, 

oo 

P(x 2 ,t 2 \ x o,~to) = J d x 1 P{x 2 ,t 2 \x 1 ,t 1 )P(x 1 ,t 1 \x ,t ) , (8.299) 

— oo 

for any value of t x . This is also known as the Chapman-Kolmogorov equation. In words, what it says is that the 
probability density for a particle being at x 2 at time t 2 , given that it was at x Q at time t Q , is given by the product of 
the probability density for being at x 2 at time t 2 given that it was at x Y at t 1 , multiplied by that for being at x x at t x 
given it was at x Q at t Q , integrated over x x . This should be intuitively obvious, since if we pick any time t 1 £ [t , t 2 ], 
then the particle had to be somewhere at that time. Indeed, one wonders how Chapman and Kolmogorov got their 
names attached to a result that is so obvious. At any rate, a picture is worth a thousand words: see fig. 8.9. 

Proceeding, we may write 

oo 

P(x,t + St \x ,t ) = dx' P(x,t + St\x',t)P(x',t\x ,t ) . (8.300) 



Now 

P(x, t + St\ x', t) = (S(x - 8x(t) - x')) 

1 + (Sx(t)) + %([5x(t)] 2 )-^ + ...\S(x- x') (8.301) 



dx' 1 dx' 



= S(x x') + F 1 (x') dS{x - X>) St + \F 2 {x') ^ x /) st + 0({Stf) 
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t 








Figure 8.9: Interpretive sketch of the mathematics behind the Chapman-Kolmogorov equation. 



where the average is over the random variables. We now insert this result into eqn. 8.300, integrate by parts, 
divide by St, and then take the limit St — > 0. The result is the Fokker-Planck equation, 



r)P f) 1 8 2 

.[ Fl (x)P(x,t)]+- — [F 2 (x)P(x,t)} 



dt dx 



(8.302) 



8.10.5 Brownian motion redux 



Let's apply our Fokker-Planck equation to a description of Brownian motion. From our earlier results, we have 



F 1 (x) = ^ , F 2 (x)=2D. 
A formal proof of these results is left as an exercise for the reader. The Fokker-Planck equation is then 



dP _ OP d 2 P 
dt dx dx 2 ' 



(8.303) 



(8.304) 

where u = F/jM is the average terminal velocity. If we make a Galilean transformation and define 

y = x-ut , s = t (8.305) 
then our Fokker-Planck equation takes the form 

dP d 2 P 

-&= D -*2- ( 8 - 306 ) 

os oy z 

This is known as the diffusion equation. Eqn. 8.304 is also a diffusion equation, rendered in a moving frame. 
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While the Galilean transformation is illuminating, we can easily solve eqn. 8.304 without it. Let's take a look at 
this equation after Fourier transforming from x to q: 

oo 

P(x,t)= f^-e iq *P(q,t) (8.307) 



(8.308) 



Then as should be well known to you by now, we can replace the operator ^ with multiplication by iq, resulting 
in 

^ P(q, t) = -(Dq 2 + iqu) P(q, t) , (8.309) 

with solution 

P(q, t) = e- Dq2t e~ iqut P{q, 0) . (8.310) 
We now apply the inverse transform to get back to x-space: 

oo oo 

p( Xj 1) = J ^ e lqx e~ DqH e~ lqut j dx' e - iqx ' P{x' , 0) 

— OO 
OO 

— oo — oo 
oo 

= [dx' K(x-x',t)P(x',0) 



— e~ Dq2t e Mx-ut-x') (8.311) 



where 

K U t ) = —L= e -{*-vt?/±Dt ( g 312) 

is the diffusion kernel. We now have a recipe for obtaining P(x, t) given the initial conditions P(x, 0). If P(x, 0) = 
5(x), describing a particle confined to an infinitesimal region about the origin, then P(x, t) = K(x, t) is the prob- 
ability distribution for finding the particle at x at time t. There are two aspects to K (x, t) which merit comment. 
The first is that the center of the distribution moves with velocity u. This is due to the presence of the external 
force. The second is that the standard deviation a = \j2Dt is increasing in time, so the distribution is not only 
shifting its center but it is also getting broader as time evolves. This movement of the center and broadening are 
what we have called drift and diffusion, respectively 



8.10.6 Master Equation 

Another way to model stochastic processes is via the master equation, which was discussed in chapter 3. Recall 
that if P i (t) is the probability for a system to be in state | z ) at time t and W i: j is the transition rate from state \j) to 
state | i ), then 

dP 

3 

Consider now a model in which the states | n ) are labeled by nonnegative integers. Let a n denote the rate of 
transitions from | n ) — > | n + 1 ) and let f3 n denote the rate of transitions from | n ) — *■ | n — 1 ) . The master equation 
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then takes the form 17 

dP 

= <*n-lPn-l + /Wl P n+l ~ K + ^)^„ ■ (8-314) 

Let us assume we can write a n = Ka{n/K) and /3„ = K{3(n/K), where K 3> 1. We assume the distribution 
P n (t) has a time-dependent maximum at n = K<f>(t) and a width proportional to \J~K. We expand relative to this 
maximum, writing 

n = Kcj>(t) + VA£ , (8.315) 

and we define 

P n (t) = n(Z,t)- (8-316) 
We now rewrite the master equation in eqn. 8.314 in terms of i7(£, t). Since n is an independent variable, we set 

dn = K<pdt + VKd£ => df| n = -VF<£dt. (8.317) 

Therefore 

f = -^f + f- 

Next, we write, for any function f n , 

f n = Kf(ct> + K- l ' 2 e ) 

= km) + k 1 ' 2 i f(cf>) + 1 e 2 r (0) + ■ ■ ■ ■ 

Similarly, 

= Kf{4>) + A 1/<2 £ /'(<£) ± + | e f"(cf>) +... . 
Dividing both sides of eqn. 8.314 by \[K, we have 

-% * + K ~ l/2 d -§ = $-^ + K - 1/2 {^' -#)t^ + &a + fi^ + (?-<t)n} + .... (8.321) 

Equating terms of order K° yields the equation 

</> = a{4>) - 0(<f>) . (8.322) 
Equating terms of order K~ l l 2 yields the Fokker-Planck equation, 

^ = (^-")|(^) + Ma + /?)f|, (8-323) 
where a = aypit)) and (3 = /3(6(t)) are to be regarded as functions of t, i.e. a(t) and Pit). 

If in the limit t — > oo, eqn. 8.322 evolves to a stable fixed point <j>* , then the stationary solution of the Fokker-Planck 
eqn. 8.323, J7 eq (£) = n(£, t = oo) must satisfy 

(/3-a)^(C77 cq )+I(a + ^)^ = =► 77 cq (£) = -jL= e"^ 2 , (8.324) 



(8.319) 



(8.320) 



di ^ cqy 2v ry oe cqv ' V2 



1T<J* 



where 

= f>m-°m ■ (8325) 



17 We further demand /3 „ = and P_ 1 (i) = at all times. 
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8.11 Appendix I : Boltzmann Equation and Collisional Invariants 

Problem : The linearized Boltzmann operator Lip is a complicated functional. Suppose we replace L by C, where 



„ , , / \ ( m \ 3 ^ 2 f , / mu 2 



/ TOM 2 




( mv 2 




\2k B T 


1) 


\2k B T 


1) 



(8.326) 



Show that L shares all the important properties of L. What is the meaning of 7? Expand ip(v,t) in spherical 
harmonics and Sonine polynomials, 

i>(v, t)=J2 ar*»(t) S r £+l (x) x 1 ' 2 Yi(n), (8.327) 



rtrn 



2 



with x = mv 2 /2k B T, and thus express the action of the linearized Boltzmann operator algebraically on the expan- 
sion coefficients a r £ m (t). 

The Sonine polynomials S™ (x) are a complete, orthogonal set which are convenient to use in the calculation of 
transport coefficients. They are defined as 



T(a +n + 1) (-x) 

and satisfy the generalized orthogonality relation 



' — : 1 (a 4- m + 1 n — m ! m! 

m=0 v y v 7 



dze-^SSCaOSS'Cr) = £^±^±11 A, . (8 .329) 



Solution : The 'important properties' of L are that it annihilate the five collisional invariants, i.e. 1, v, and w 2 , and 
that all other eigenvalues are negative. That this is true for C can be verified by an explicit calculation. 

Plugging the conveniently parameterized form of ip(v, t) into £, we have 

00 

A/> = -lY, a ^m(t) S r e+ i(x) x 1 ' 2 Yi{h) + ^ £ a rlm {t)Jd Xl x\ /2 e-'i 

x /dni [l + 2x 1 / 2 a;l /2 n-^ + | (x - |) (x, - §)] ST ifo) x f / 2 F^(n x ) , 

where we've used 



(8.330) 



^x\' 2 , du = J'^x- 1/2 dx 1 . (8.331) 
to 1 V 2m 1 1 v ' 



Now recall ^(n) = T/JJ an d 



Y^(n)=-\—wi0e iv Y^{h) = J— cosd Y-An) = +\— sinfle -1 *' 

V 87T V 47T V 87T 

^1/2( X ) = 1 ^3/2 ( X ) = 1 ^1/2 0*0 = I _ ^ > 
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which allows us to write 



l = 4nY »(h)Y» (ni) 

47T 



nn 1 = - 



Y \h) ^(m) + Yl(n) Yf (hi) + Y\(h) rV(f»i) 



(8.332) 



(8.333) 



We can do the integrals by appealing to the orthogonality relations for the spherical harmonics and Sonine poly- 
nomials: 



dnY^{fi)Y l m , (n) = Su'Sr, 



dx e~ x x a S2 (x) Si (x) = T ^ + a + 1) 8, 



r(n + l) 



Integrating first over the direction vector hi, 



rlra 

2 7 



= X)°r^(*) Jdx^fe-^ Jdh, Y°(h)Y °* (h!) S° 1/2 (x) S° 1/2 ( Xl ) 

rlra q 

1 

+ lxV*x\' 2 J2 Y^Y^ih^S^S^ix,) 



rn' — — l 



+ lY (h)Y *(h 1 )S 1 1/2 (x)Sl /2 (x 1 ) 

we obtain the intermediate result 



Sl^x^YHhx) 



rlra 



^= a rtm{t) Jdx, x\ /2 Y °(h) S w 6 m0 S° 1/2 (x) S° 1/2 ( Xl ) 

rlra q 

1 

+ | x^x 1 / 2 ]T Y^,(h) 5a 6 mm , S° 3/2 (x) S^xJ 

ra' — — 1 

+ lY °(h)6 l0 6 m0 Sl /2 (x)Sl /2 (x 1 ) 



(8.334) 
(8.335) 



(8.336) 



(8.337) 



S i (x-i ) x-i ^ . 

Appealing now to the orthogonality of the Sonine polynomials, and recalling that 

r(f) = V5r , r(l) = l , T(z + 1) = zT(z) , (8.338) 

we integrate over x x . For the first term in brackets, we invoke the orthogonality relation with n = and a = \, 
giving r(|) = For the second bracketed term, we have n = but a = |, and we obtain = | T(|), 

while the third bracketed term involves leads to n = 1 and a = |, also yielding = | T(|). Thus, we obtain 
the simple and pleasing result 

W = -^ a rUt) S r i (s) il(n) (8.339) 



rlra 
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where the prime on the sum indicates that the set 

CI ={(0,0,0), (1,0,0), (0,1,1), (0,1,0), (0,1,-1)} (8.340) 

are to be excluded from the sum. But these are just the functions which correspond to the five collisional invariants! 
Thus, we learn that 

AeM = N rlm S r , (x) x 1 ' 2 Yi(h), (8.341) 

1+ 2 

is an eigenfunction of C with eigenvalue —7 if (r, £, rri) does not correspond to one of the five collisional invariants. 
In the latter case, the eigenvalue is zero. Thus, the algebraic action of C on the coefficients a r £ m is 



{Co) 



rim 



-7 arfm if (r, £, m) f CI 
= if (r, t, m) G CI 



The quantity r = 7 1 is the relaxation time. 
It is pretty obvious that C is self-adjoint, since 

(0|£VO = fePof(v)<Kv)C[il>(v)] 



-771 



2ttKT 



3/2 



d v exp — 



2KT 



(j)(v)ip(v) 



7n 



2nk p T 
x 4>(v) 



drv / d 3 u exp 



mu 



exp 



mv 



m 2 ( mu 

l+ kT U - V - 



2k B T J 1 V 2fc B T 

2 3\ / mv 2 3 
3 V 2fcT 2/ V2A~r ~ 2 



ip(u) 



(8.342) 



(8.343) 



where n is the bulk number density and f°(v) is the Maxwellian velocity distribution. 

8.12 Appendix II : Distributions and Functionals 

Let x £ K be a random variable, and P(x) a probability distribution for x. The average of any function <j){x) is then 



dx P(x) cf)(x) I dx P{x) 



(8.344) 



Let r)(t) be a random function of t, with rj{t) e M, and let P [77(f)] be the probability distribution functional for 77(f). 
Then if <P [77(f)] is a functional of 77(f), the average of <P is given by 



D77P [77(f)] # [77(f)] / /^77P [77(f)] 



(8.345) 



The expression J -D77 P[?7] ^[77] is afunctional integral. A functional integral is a continuum limit of a multivariable 
integral. Suppose 77(f) were defined on a set of f values t n — nr. A functional of 77(f) becomes a multivariable 
function of the values rj n = rj (f n ). The metric then becomes 



(8.346) 
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In fact, for our purposes we will not need to know any details about the functional measure Drj; we will finesse 
this delicate issue 18 . Consider the generating functional, 

Z[j(t)] = fDr]P[ri]exp( fdt J(t)r){t) \ . (8.347) 



It is clear that 

1 S n Z[J] 



Z[J] 5J(t 1 )---6J(t n ) 



= (v(h)---v(t n ))- (8-348) 

J(t)=0 



The function J(t) is an arbitrary source function. We differentiate with respect to it in order to find the ?7-field 
correlators. 

Let's compute the generating function for a class of distributions of the Gaussian form, 

P[rj\ = cxp ^ - -L jdt (r 2 if + rf) ^ (8.349) 



OO 

1 f did 



= ex P l -— J — (l+cV)|r)M| z | . (8.350) 

^ — OO ' 

Then Fourier transforming the source function J it), it is easy to see that 

(co i ^ 1 2 \ 

— OO 

Note that with 77(A) S K and J(t) £ R we have 77* (a>) = rj(—u>) and J* (w) = J(— w). Transforming back to real time, 
18 A discussion of measure for functional integrals is found in R. P. Feynman and A. R. Hibbs, Quantum Mechanics and Path Integrals. 
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we have 



oo oo 



Z[J] = Z[0] ■ exp f i /a / dt' J(t) G(f - J(t') J , (8.352) 



-OO — OO 



where 

G(«) = f e ~ |s|/T , GM — -, T~2 2 ( 8 - 353 ) 
is the Green's function, in real and Fourier space. Note that 

so 

fds G(s) = G(( 

— C 

We can now compute 

<#ih(* 2 )> = G(*i-* 2 ) (8-355) 

(v(h)v(t 2 ) v(t 3 ) v(h)) = G(h - t 2 ) G(t 3 - i 4 ) + G(*! - t 3 ) G(t 2 - t 4 ) (8.356) 

+ G(t 1 -t 4 )G(* 2 -t 3 ). 

The generalization is now easy to prove, and is known as Wz'dc's theorem: 

contractions 

where the sum is over all distinct contractions of the sequence 1-2 • ■ • 2n into products of pairs. How many terms 
are there? Some simple combinatorics answers this question. Choose the index 1. There are (2n — 1) other time 
indices with which it can be contracted. Now choose another index. There are (2n — 3) indices with which that 
index can be contracted. And so on. We thus obtain 

„. . # of contractions ,„ . ,„ „. „ (2n)\ ,„ „^„s 

C M= ofl-2-3...2n =(2n-D(2»-3)-3.1 = i^ f . (8.358) 

8.13 Appendix III : General Linear Autonomous Inhomogeneous ODEs 

We can also solve general autonomous linear inhomogeneous ODEs of the form 



a 



1 — + a x = t(t). (8.359) 



dt n n ~ 1 dt™- 1 dt 
We can write this as 

A x(t) = £(f) , (8.360) 

where C t is the n th order differential operator 

d n d n ~ l d 

Ct = dF> + d^ + --- + a 'dt + a °- (8361) 

The general solution to the inhomogeneous equation is given by 

oo 

x(t) = x h {t) + [ dt' G(t, t') £(t') , (8.362) 
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where G(t, t') is the Green's function. Note that C t x h (t) = 0. Thus, in order for eqns. 8.360 and 8.362 to be true, 
we must have 

this vanishes 

A CO 

A x{t) = C t x h (t) + J dt' C t G(t, t') ^(f) = ffl , (8.363) 

— CO 

which means that 

C t G{t,t') = 5{t-t') , (8.364) 

where 5(t — t')is the Dirac 5-i unction. 

If the differential equation C t x(t) = £(i) is defined over some finite or semi-infinite t interval with prescribed 
boundary conditions on x(t) at the endpoints, then G(t, t') will depend on t and t' separately. For the case we are 
now considering, let the interval be the entire real line t E (— oo, oo). Then G(t, t') = G(t — t') is a function of the 
single variable t — t'. 

Note that C t = C(4A may be considered a function of the differential operator A. If we now Fourier transform 
the equation C t x(t) = we obtain 



— oo — oo 

OO f 

= fdt e wt \ {-ioj) n + a n _ t {-i^y- 1 + . . . + Oi (-iw) + a } x(t) 



(8.365) 



- OC- 



Thus, if we define 

n 

C{u>) = ^a fc Htj) fe , (8.366) 

k=0 

then we have 

£(lu) x(lo) = i(u) , (8.367) 

where a n = 1. According to the Fundamental Theorem of Algebra, the n th degree polynomial C(to) may be 
uniquely factored over the complex u> plane into a product over n roots: 

£{uj) = {-i) n (a - wi)(u> - cj 2 ) • ■ • - w„) . (8.368) 

If the {a k } are all real, then [£(w)] * = C(—u>*), hence if fl is a root then so is —Q*. Thus, the roots appear in pairs 
which are symmetric about the imaginary axis. I.e. if fl = a + ib is a root, then so is — fl* = —a + ib. 

The general solution to the homogeneous equation is 

n 

x h (t) = J2A < ,e- i "° t , (8.369) 

cr=l 

which involves n arbitrary complex constants A;. The susceptibility, or Green's function in Fourier space, G(ui) is 
then 

&(u>) = = 7 w r , (8-370) 

Note that [G(w)] = G{— u>), which is equivalent to the statement that G(t — t') is a real function of its argument. 
The general solution to the inhomogeneous equation is then 

CO 

x(t)=x h (t)+ [d? G{t-t')£(t') , (8.371) 
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where x h (t) is the solution to the homogeneous equation, i.e. with zero forcing, and where 

oo 

G(t-t') = f -e-^'-^GH 

./ 27T 



-oo 
oo 



I 



^ e ~ Mt ~° (8.372) 

2lT (uj — U!i)(uj — UJ2) ■ ■ ■ (oj — u> n ) 



™ e -;ov(t-t') 

= E^--— e(t-0, 
CT= 1 

where we assume that Imw^ < for all cr. This guarantees causality - the response x(t) to the influence is 
nonzero only for t > t'. 

As an example, consider the familiar case 

C(uj) = —lu 2 — ijuj + ujq 

= -(u) - w + ) (lj - oj_) , (8.373) 



with w ± = — 17 ± 0, and /3 — JuJq — I7 2 . This yields 

C! ( w± ) = qp(w+ - w- ) = qF2/9 . (8.374) 

Then according to equation 8.372, 

iC'{w + ) iC'(u_)\ 

p-7«/2 p -*/9« „i/3s ~| (8.375) 

e( s ) 



— 2i/3 2i(j 
= fi- 1 e- ls/2 sin(/3s)6(.s) . 

Now let us evaluate the two-point correlation function (x(t) x(t')), assuming the noise is correlated according to 
(£( s ) £( s ')) = <M S — s ')- We assume i, t' — > oo so the transient contribution cc h is negligible. We then have 



oo oo 



(x(t)x(t'))= jds I ds' G(t - s) G(t' - s') (£(s) £(«')) 

(8.376) 



27T 



Higher order ODEs 



Note that any n th order ODE, of the general form 
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may be represented by the first order system tp = V(<p). To see this, define (p k = d k 1 x/dt k 1 , with k = 1, . . . , n. 
Thus, for k < n we have ip k = <p k+1 , and <p n = F. In other words, 



d_ 

dt 



An inhomogeneous linear n th order ODE, 

d n x 



V(tp) 



9-> 



9n 



\<p n / \F(^..., Vp )J 



dt n 



+ -^ZT + -.-+a 1 — +a x^ f (t) 



may be written in matrix form, as 



dt 



(8.378) 



(8.379) 



d 


(<Pi\ 


/ 1 •■ 
1 


• ° \ 




M 


+ 





dt 














w 


\-a -a x -a 2 • • 


' -°>n-J 









(8.380) 



Thus, 

<P = Qv + £,, (8-381) 
and if the coefficients c k are time-independent, i.e. the ODE is autonomous. 

For the homogeneous case where £(t) = 0, the solution is obtained by exponentiating the constant matrix Qt: 

<p{t) = exp(Qi) ¥5(0) ; (8.382) 

the exponential of a matrix may be given meaning by its Taylor series expansion. If the ODE is not autonomous, 
then Q = Q(t) is time-dependent, and the solution is given by the path-ordered exponential, 



¥5 (t) = Pexp| Jdt' Q(t')\ <p(Q) , 



(8.383) 



where P is the path ordering operator which places earlier times to the right. As defined, the equation (p = V(<p) 
is autonomous, since the i-advance mapping g t depends only on t and on no other time variable. However, by 
extending the phase space May from M -> M x 1, which is of dimension n + 1, one can describe arbitrary 
time-dependent ODEs. 

In general, path ordered exponentials are difficult to compute analytically. We will henceforth consider the au- 
tonomous case where Q is a constant matrix in time. We will assume the matrix Q is real, but other than that it 
has no helpful symmetries. We can however decompose it into left and right eigenvectors: 



n 

Qij = ^ Vg R <j,i L a,j 

(7=1 



Or, in bra-ket notation, Q = J2 a v <j I^O (L a \. The normalization condition we use is 

{L a \R a ,) = S aa ,, 



(8.384) 



(8.385) 
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where {u a } are the eigenvalues of Q. The eigenvalues may be real or imaginary. Since the characteristic poly- 
nomial P{v) = det {yl — Q) has real coefficients, we know that the eigenvalues of Q are either real or come in 
complex conjugate pairs. 

Consider, for example, the n = 2 system we studied earlier. Then 

Q = (_l , _M . (8.386) 



The eigenvalues are as before: v ± = — 57 ± J I7 2 — cjq . The left and right eigenvectors are 



± 

The utility of working in a left-right eigenbasis is apparent once we reflect upon the result 

n 

f(Q) = J2f(^)\^)( L -\ ( 8 - 388 ) 

<T = 1 

for any function /. Thus, the solution to the general autonomous homogeneous case is 

n 

\<p(t)) = Y i e VJ \K)(L a \<P(0)) (8-389) 

n n 

<p&) = E ^ R *,i E L ^ ^(°) • ( 839 °) 

a=l j=l 

If Re (1^) < for all a, then the initial conditions <p(0) are forgotten on time scales r CT = v~ x . Physicality demands 
that this is the case. 

Now let's consider the inhomogeneous case where £(i) 7^ 0. We begin by recasting eqn. 8.381 in the form 

|(e-« t V )=e-« t €(t). (8.391) 

We can integrate this directly: 

t 

tp(t) = e Qt tp(0) + Jds e Q{t - s) £(s) . (8.392) 


In component notation, 

t 

n n „ 

<T=1 cr=l q 

Note that the first term on the RHS is the solution to the homogeneous equation, as must be the case when £(s) = 0. 

The solution in eqn. 8.393 holds for general Q and £(s). For the particular form of Q and £(s) in eqn. 8.380, we 
can proceed further. For starters, (L a \£(s)) = L a n £(s). We can further exploit a special feature of the Q matrix to 
analytically determine all its left and right eigenvectors. Applying Q to the right eigenvector \R a ), we obtain 

. , , 0* > 1) - (8.394) 



8.13. APPENDIX III: GENERAL LINEAR AUTONOMOUS INHOMOGENEOUS ODES 



415 



We are free to choose R al = 1 for all a and defer the issue of normalization to the derivation of the left eigenvec- 
tors. Thus, we obtain the pleasingly simple result, 

K,k =v>T 1 . (8.395) 
Applying Q to the left eigenvector (L a \, we obtain 

-«0 L <T,n = v a L aA (8.396) 

L^_ 1 -a j _ 1 L^ n = VtT L ad (j>l). (8.397) 
From these equations we may derive 

j k—l j n 

ft ^ ft 

The equality in the above equation is derived using the result P{v a ) = Y^j=o a j v i = Recall also that a n = 1. 
We now impose the normalization condition, 

2 L aih R atk = l. (8.399) 
fc=i« 

This condition determines our last remaining unknown quantity (for a given a), L a : 

n 

( L a | R a > = L CT> „ £ fc a fc = P\v a ) L^ n , (8.400) 

fe=l 

where P'(f) is the first derivative of the characteristic polynomial. Thus, we obtain another neat result, 

K, n = • (8.401) 

Now let us evaluate the general two-point correlation function, 

C..,(t,i') = (^(t) ^.,(f )) - (^-(t)) (p 3 ,(f )) • (8.402) 

We write 

oo 

<£(*) = <K* *') = \% fa) e"^ (s - s,) . (8.403) 

— oo 

When 4>{uj) is constant, we have (£(s)£(s')) = (j)(t)5(s — s'). This is the case of so-called white noise, when all 
frequencies contribute equally. The more general case when 4>(u>) is frequency-dependent is known as colored 
noise. Appealing to eqn. 8.393, we have 

-i "-1 * *' 

C„,(t,0 = V -2 /ds e^ (t " s) /ds' e^' ( *'- s,) 0(s - s') (8.404) 

^ P'(i/„) P'fi/,) J J 



o 

oo 



^r 1 0H( e — *- e ^)(e^'- e ^ t ') 

P'(^) P'^,) ( W - iu a )( u + iu a ,) 



In the limit t, t' — > oo, assuming Re (i^) < for all er (i.e. no diffusion), the exponentials e v " t and e Va>t may be 
neglected, and we then have 

c n , {t X) = Y. — ^— It- *" )e """" ,> ■ (MK) 
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8.14 Appendix IV : Correlations in the Langevin formalism 

As shown above, integrating the Langevin equation p + *yp = F + r/(t) yields 



t 

p(t) = p(0) + - (1 - e-* 1 *) + (da r/(s) e 7(s -*> . (8.407) 



n 



7 

Thus, the momentum autocorrelator is 

t t' 

<p(t)p(0) - (p(t))(p(t')) = Jds Jds> e ^-<> e^'" 4 ') (r,(s) V (s')) 

o o 



(8.408) 



r e -^ t+t "> ( ds e 2 ^ = Mk B T (V^^'l - e -^ t+ ''>) 



where 



t min = mm(t,t') = { (8.409) 
\ t if t < t 



is the lesser of t and i'. Here we have used the result 



Jds Jds' e T( s+s ') 5(a - s') = Jds J ds' e 7(s+s,) S(s - s') 



*min 





(8.410) 



One way to intuitively understand this result is as follows. The double integral over s and s' is over a rectangle 
of dimensions t x t'. Since the S-i unction can only be satisfied when a = s', there can be no contribution to the 
integral from regions where s > t' or a' > t. Thus, the only contributions can arise from integration over the 
square of dimensions < min x t min . Note also 

t + t' -2min(M') = \t-t'\ . (8.411) 

Let's now compute the position x(t). We have 

t 

x(t)=x(0) + ^jdsp(s) 



t r -i t S 

J ds ( w(o) -^) e ~ 7s+ ^7 +hhh^ eiis '- 8) (s - 4i2) 



x(0) + Ids 







(x(t)) + ± [ds fds lV ( Sl )e^- s) , 
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Figure 8.11: Regions for some of the double integrals encountered in the text. 



with v = p/M. Since (r](t)') = 0, we have 



(x(t)) = x(0) + jds 



v(0) - e" 7S + — 



(8.413) 



-ft) 



Note that for jt <C 1 we have (x(t)) = x(0) + v(0) t + \M~ x Ft 2 + 0(t 3 ), as is appropriate for ballistic particles 
moving under the influence of a constant force. This long time limit of course agrees with our earlier evaluation 
for the terminal velocity, = (p(oo)\ /M = F/jM. 

We next compute the position autocorrelation: 



t t' 



(x{t)x{t'))-{x{t)){x{t')) = ^ jds (ds'e-^ + ^ fd Sl jd S [e^+^( V ( Sl ) V (s 2 )) 





t t' 







r 



2 7 M 5 



jds Ids' ( e -rl— 'I _ e -r(«+»'>) 



(8.414) 



We have to be careful in computing the double integral of the first term in brackets on the RHS. We can assume, 
without loss of generality, that t > t'. Then 



t t' 



t t' 

Jds Jds' e-^ s ~ s '\ = Jds' ei"' Jds e" 7S + Jds 1 e" 18 ' Jds 

s' 



(8.415) 



2^H' + j- 2 (e-'" +e 



-ft' 



1 - e 



-f(t-t') 



)• 



We then find, for t > t' , 



91- T h T 

(x(t) x(t')) - (x(t))(x(t')) = thjLt' + ^L (2e-* + 2e-* - 2 - e"^"' > - e"^+' >) . 



jM 7 2 M 



(8.416) 
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In particular, the equal time autocorrelator is 

( X 2 (t)) - ( X (t)) 2 = ^ * + ^ (46-r* - 3 - e-^) . (8.417) 

We see that for long times 

(x 2 (t)) - (x{t)f ~ 2Dt , (8.418) 
where D = k B T/~/M is the diffusion constant. 



8.15 Appendix V : Kramers-Kronig Relations 

Suppose x{lo) = G(to) is analytic in the UHP 19 . Then for all v, we must have 

oo 

[P = , (8.419) 

J Air v — u> + it 

— oo 

where e is a positive infinitesimal. The reason is simple: just close the contour in the UHP, assuming x(u>) vanishes 
sufficiently rapidly that Jordan's lemma can be applied. Clearly this is an extremely weak restriction on x(u>), 
given the fact that the denominator already causes the integrand to vanish as |o;| _1 . 

Let us examine the function 

\ v — uj ie ,„ ,„„ N 

= 7 ^ 9 - 7 « 9 ■ ( 8 - 42 °) 

v — u) + ie (y — u)) z + e z [y — lo) z + e z 

which we have separated into real and imaginary parts. Under an integral sign, the first term, in the limit e — > 0, 
is equivalent to taking a principal part of the integral. That is, for any function F(v) which is regular at v = u>, 

oo oo 

fdv v — to _, . fdv F(v) .„ 
llm n / T- 7 vTT^ F M = P / 7T- ■ (8-421) 

— oo — oo 

The principal part symbol p means that the singularity at v = oj is elided, either by smoothing out the function 
\j{y — e) as above, or by simply cutting out a region of integration of width e on either side of v = u>. 

The imaginary part is more interesting. Let us write 

h(u) = . (8.422) 

For |u| 3> e, h(u) ~ e/u 2 , which vanishes as e — > 0. For u = 0, /i(0) = 1/e which diverges as e — > 0. Thus, has 
a huge peak at u = and rapidly decays to as one moves off the peak in either direction a distance greater that 
e. Finally, note that 

OC 

duh(u) = ir, (8.423) 

a result which itself is easy to show using contour integration. Putting it all together, this tells us that 

lim 6 = ttS(u) . (8.424) 



9 In this section, we use the notation x( w ) for the susceptibility, rather than G(u) 
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Thus, for positive infinitesimal e, 



1 P -r- ■ X( \ 

— — =p iiroyu) 



u±ie u 



(8.425) 



a most useful result. 

We now return to our initial result 8.419, and we separate x(lo) into real and imaginary parts: 

xM = x M + «x"M • 



(8.426) 



(In this equation, the primes do not indicate differentiation with respect to argument.) We therefore have, for 
every real value of ui, 

oo 



X ) + ix" (y) in6(v-u) 



Taking the real and imaginary parts of this equation, we derive the Kramers-Kronig relations: 

oo 



(8.427) 



7T f — CJ 



X'{v) 



7T V — OJ 



(8.428) 
(8.429) 



